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GCCPOOOOOC TCATOQQOQC GACACTOCAC CATCAATCAC TOOOCIGIGA 50 

QGAACEACIG ICITCAOaCA. GAAAQOGICr TEAGIMXS^ 100 

TCICGIGCAG CXTaC3^GGAC CXXXXXTOOC QOGAGAOXA TAGIQGTICIG 150 

OQGAACXDQGfT GAGEACACOG GAATIQOCAG GADGACGQQG l OJi ' l ' lUriU 200 

GATAAAOOOG CICAAIQOCT GGAGATITOG QOGflGGOCXiX: GCAAGACIQC 250 

TAQOCXSAGTA GiUi'iUQSIC QOGAAAGQOC TIGIGCTACT GOCIGATAGG 300 

UiULTlUOSA. GIQ00aCX3QG AOJiCiLUlA GADCGIQC2C: CATCAGCACG 350 

i^TOCEAAAC CICAAAGAAA AACCAAAOGT AACACGAAOC 01030X30^. 400 

QGAOC?ICAAG TrOCX3C33GIG GOC33IGAGAT ajriUL?lUGA. Ui ' l ' mJl'lUi ' 450 

TGOOGOSCAG GGGOCXTEAGA TIOGGFIGIQC G0303ACGAG GAAGACno: 500 

GAQOGCTGGC AACCIC3GAGG TAGAOGICAG GCTATCQOCA AGGCAOC3ICG 550 

GOCXDGAGGGC AGGACCTOQG CICAQ00CX3G GTAODCnGG OOOCTCTATC 600 

GCAATCAGGG TT3QGQGrD3G GOG3GA3I3GC TOCIGICIOC CXEnQOGICT 650 

a3G0CTAGCr GGG30QQCAC AGAOCXXXDQG CGEMGTCGC GCAATTIGGG 700 

TAAGGICATC GATACDQCITA GGIQOQGCIT aSODGACCIC AIOGGGTACA 750 

TAOOGCiaJT a3GCX3O0GCr CnGG?»i3G03 CIGOCAGGGC CXin3303CAT 800 

QGCGICQGQG TIUIGGAAGA aBGOJDGAAC TATOCAACAG GGAACCTia: 850 

TOGTIGCTCT TICTCTCAICr TOL'i'iUiUGC Ol'ItX-TCTCT TGOCTCACIG 900 

TCCXXDGCnC Nsocrrpcci^ GIDOGCAATT OGroGOOGCr TTAOCAIGIC 950 

ACCAATCATT QCaTTAACIC GAGTATTGIG TAOGAQQGGG COGAIGQCAT 1000 

CXTTQCACACT COQGQGrGIG TCCCTTOOCjr TOGOGAGQGT AAOGQCTOGAl 1050 

GGIGirOGGT QGOGGIGADC CQCAOGGIGG OCACXZAQGGA aSOCAAACTC 1100 

CXXZACAAOGC AQCTIOGADG TCATAIOGAT ClGL'i'iGiCG GSGCQGCAC 1150 

OCICIGCIOG QCQCICTADG TQQQGGADCT GIGOQQGICT GICITICTTG 1200 

TIGGICAACT GnTAOCTIC TCTOQCAOSC GOCACIGGAC GAQSCAAGAC 1250 

1GCAATTGIT CEATCIATCC OGGQCATAIA A03GGICATC aCAIQGCATC 1300 

GGATATCATC AIGAACIGGT OOOCTAOGGC AGGGITOGIG GTAGCICAGC 1350 

TOCIDQGGAT ODCACAAGOC ATCA3GGACA T3ATOGCIQG TOCTCACIGG 1400 

QGAGICCIGG OQGGGATAQC GTAli'i'iUiUG AIGGnSGGGA ACIQGQOGAA 1450 

OGTOCIGGm GIQCIQCIQC TAmODGGG OGIDGPOOOG GAAADOCACG 1500 

TCADCQQQGG AAAIQQGQQC OQCAOCAOSG CDGGGCTiGi' TQGICIGCIT 1550 

ACACCAG3GG OCAAGGAGAA CATOGAACTG ATCAACACCA. ADGGCAGITG 1600 

QCACATCAAT AGCAOOaOCT TGAATTGCAA TGAAAOOCIT AACACOGQCT 1650 

GGTrAQCAQG QCTCrTCIAT CAACACAAAT TCAACICTIC AGQCiGiCCT 1700 

GAGAOGTIGG GCAQCIGOCG AaaOCTEAOC GAliTi'lUOCC AGQGCIQQGG 1750 

TCCTATCAGT TATGQCAADG GAAQQQGOCT OG?OGAACX3C QOCTACIQCT 1800 

GGCACEACCX: TCCAAGADCT TGIGQCATIG TGQOQQCAAA GAGOGIGIGT 1850 

OGCQOGGTAT ATTGCi'iCAC IXDQCAGOOQC GIGGIGGIGG GAADGACOGA 1900 
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CAQGIOQaOC GCOCCmXTD IQCAAAIGAT ACGGAIGICT 1950 

TCGIOCITAA CAACAOCAQG CCAa33CIGG QCAAnaGfTT aaGflTGEACC 2000 

TOGAIGAACr CAACIQGATT CA0C3W^GIG IQOaGAGOQC OOO L'l ' iUlUi' 2050 

GATCGGAQQG Gia3QCAACA ACACCTiaCT CIGOOOCACT GALl'lGCl'lUC 2100 

QCAAACMXDC QGAAQOCACA. TACICTOOCT QaaQCKXX3G TDOCTOGATr 2150 

ACACOCSOjr GCAiaGTOGA. CTAGOOSIM? AQQL'i'i'iUQC ACTA3DCTIG 2200 

TAOCAICAAT 120^0CATAT TCAAAGTtCAG GATCTCACGTO QGAQQQGflDG 2250 

AQCACAGQCr QGAAOaQOOC TOCAACIQGA. OQCXaaOQaGA AOQCIGnSAT 2300 

GaG?O^GGIC OGftGCICAGC OOSnOCIQC TCICCAOCAC 2350 

ACAGIQQCAG GTOCITOCCT G'i'iUi'i'iCAC GADOCTOCXZA G QJI'IUIU CA. 2400 

OCX330CICAT OCAC3CTCCAC CAGAACATIG TQGAOGIGCA GTEACTIGEAG 2450 

QGGSrAQQCT CAAGCMXX3C GIDCIGQQOC ATTAAGIGOG AGTACGTOCT 2500 

TL'iUCiGi'iC CITCIQCriG CAGAO3a30G GGICIGCiUC TOCTIGIOGA. 2550 

TGATCTEACr CATATCQCAA G03GAGQ03G Ci'i'iUGAGAA OCICGTAAm 2600 

CTCAAIQCAG CATCCXnOQC CQQGACX3CAC QCJICriGIGr OCTTQCiaJT 2650 

(Ji'iL'i'iCroC TTiaCGIGGr ATCIGA^^OSG TAQSIQQGrrG GQGQGAQCX3G 2700 

TUTACGCQCTr CIAa93GATC TOOanCIOC TOCIOC'ICCT GCIGQOGTIG 2750 

OCICAGCX3QG CATAOGCACT QGACA03GAG GIGQCX^QCGT OGflGIGQCQG 2800 

OJi'iUi'iUi'i' GTQQGSITAA T3Q03CTGAC TCIGTOQCXIA TATEACAAQC 2850 

GCTA.TAICAG CTOGIQCATC TOGIGQCTTC AGim'i'i'iCT GADGAGAGIA 2900 

GAAQQGCAAC TGCAOGIGIG GCTIOOOQGC CTCAACGIO: GGGQGGGGCG 2950 

OGAIGGOGIC ATCTTACICA. TUiUiUJJAGT ACACOOGAOC CIGGrEATTIG 3000 

ACATCACCAA ACTACIOCIG QCXZALLUi'lOi GAOGOJi'i'iG GAL L ' iUi ' lC AA 3050 

QOCAUi'i'iUC TTAAAGIOQC CTACi'iCGiG OGOGITCAAG GC Ui ' lU l U OG 3100 

GA.TCIQGGCG CTAGOGOQGA AGAIAGGGGG Ji^GGrDGATTAC GTOCAAA213G 3150 

OCATCATCAA GITAQGaaOG CITACIQGCA. OCTAliUiUiA TAADCATCIC 3200 

ACOOGIUTIC GAGACIQ93C GCACAAOSQC CIQOGAGATC TOGOCGIQGC 3250 

TGIQGAAOCA. GiLLJiUi'iLT (DCJCGAATOGA. GAOCAAGCIC ATCAOGTOGG 3300 

QGQCAGAIAC OQOOGOGflQC GGIGACATCA TGAAQSGCIT OOOCGlCiCT 3350 

GCQOGTAQQG GOCAQGAGAT ACIGCITOQG OCACQGGADG GAATO3ICIC 3400 

CAAQOGGIOG AQL?i'iUL'iUG CX300GATCAC GQGGEA030G CAGCAGADGA 3450 

GAQGOCIOCT AGOGiUrAIA. ATCACXZAGOC TGACIGG003 GGACAAAAAC 3500 

CAAGnOGAQG GrGAQGTDGCA. GATOGTOICA ACIQCTAOOC AAACCITOCT 3550 

aGCAAOGTDQC AIXZAAIQQQG TAIQCIGGAC TGTECEAOCAC G3GG00GGAA 3600 

OlAGGAOCAT OGCATCADQC AAGGGIDCTC TGATOCAGAT GTATACCAAT 3650 

GIDGAOCAAG ACUi'iUiGQG CIGG0003CT OCKZAAGGTET OOOQCICATT 3700 

GACAOQCTGT ACCTQa3QCr CCTCGGAOCT TTAOCIOGrCC ACG^OGCAOG 3750 

CQGATCICAT TGCXCTGCGC QGGCGAGGIG AIAGCAOGGG maOL'iUL'i'i' 3800 
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TCaadOCX33C CCAliTiUL'iA CITGAAAGQC 1001033000 GTrOOGClGrr 3850 

GIGGOaaOOG GGACAOOaOO IQQOOCEATT CAQOOCXDaaG GIGIQCACJaC 3900 

GfrOGAGTOGC TAAAQOOGIG GftLTi'llAOXX: CTGIQGAGAA CXTTAQOGACA. 3950 

AOimGAGAT OOaOOGIGTT CA03GACAAC TOCICIOCAC CAQCAGTCQOC 4000 

CXZAGAGCnC CaOOIGOOOC AOCIGCAIQC ia0C3O330C PGOOGmPGA 4050 

QCADCAAGOr COOOOCIGOG TAOaCAGOOC AGOOCTACAA OGTOnOGIG 4100 

CICAAOOOCT CiUi'iUL'iGC AACQCiaaOC TnaOIQCIT iO^TCIOCAA 4150 

GOaOCftTOGG GTIGATOCm AEATCAGGAC aaOQGIGftGA ACAATEftOCA 4200 

CIGQCAQOOC CATCAGGIAC TOC3£3CTAPG QGAAGrnCCT TQOOOAOaOC 4250 

Q3GIQCICAG GAOJiUL'i'iA TCACAaS^MA ATTTGIGACG AGIQOCACIC 4300 

CACX3GAIt30C ACATOCATCT T3Q3CATO3G CA L -' lUiULTl ' GACJCA^^OSG 4350 

AGACIQCX3GG aOOGAGACIG GTIGIQCIOG GCACIQCTAC CXXTDGOOOQC 4400 

TOOOIGACTG TGIDOCATCC TAACAIO^G G^^GGflTGCIC TGIOGADGAC 4450 

03GAGAGATC COnTTTAOG QCAJ^GOCTAT OOODCTOGAG GIGATCAAGG 4500 

G3QGAAGACA TCICALLUi'iC TOQCACTCAA AGA^^GAJCTG OGAOG^iGCIC 4550 

GOOQCXSAAGC TOGrGGCAIT GGQCATCAAT QOGGTCGGQCT ACTAODGCm 4600 

TCTTGAOGIG TCIGICATCC OGADCAQGQG OGALLGi'iUiC GICGIGIDGA 4650 

CXIXSAIQCICr CATGACIQGC TTTAOOOOOG ACITGGACIC TCIGAIAGAC 4700 

TGCAACADGT GIGICACICA GACAGIGGAT TIC?^GQCTIG ACQCIACCIT 4750 

TAGCATIGAG ACAACCAOOC TOQCXDa^GGA TGCTGTCIOC AGGACICAAC 4800 

QCQQQQQCAG GACIQQCAQG QQGA^iGOC?^ QCATCTATAG ALL'i'iUiUQCA 4850 

OQQQQQGAGC GCQ0CIC03G CALLUi'iUL^C TOGIDOGTOC TCTGIGAGIG 4900 

CTAIGAOSOG QGCiUiUUi'i' GGTATC^^GCT CAOSaOOOCX: G^^GACEACAG 4950 

TTAGGCIACG AQCXJCACATC AACAOQOOOG QGCnaDOGr GIGO^GGAC 5000 

CATCTIGAAT TnOGGAGGG QLJiUi'i'm^G GOOCICACIC ATATAGAIGC 5050 

OCACmTEA TCOC^OO^ PO^PiS^GTQG QGPGAACnT OCTTAOCIG3 5100 

TAGOGTAOCA AQGCACGGIG TGOGCIJ^GGG CICA^^GOOOC TCOOOCATCG 5150 

IGGGACCAGA TGIQGAJCTG TTIGATOOOC CITAAJSOOCA OOCIGCAIGG 5200 

GQCAACACQC CIGCTATACA GACIGG303C TGnC^GAAT GA^^GTCACOC 5250 

TGAD3CACCC AATCACCAAA TACATCATCA CATOCATOIC GGOGGAOCIG 5300 

GAQGIDGICA 03AQCADCIG GGIGCIOGIT GGOGGOGTDC TOGCIGCTCT 5350 

G3CX]G0GEAT TGOCIGICAA CMGCIGOGT GGICAIAGIG aGCW3GAia3 5400 

'iCi'iUiU03G GAAQOOGGCA ATEATADCIG /O^GGGAGGT TCTCEAOCAG 5450 

GAGTIGGATC AGAIGGAAGA GIGCIC'iCAG C?CITPCCSjr ACATOGftGCA 5500 

AGGGAHISATC CTOOCIGAQC AGITCA^^OCA GA^^OGOGCIC GGQCTDCIGC 5550 

AGADOOCXJDC QOQQCAIQCA GAOGTIATCA CCCCTGCIGT QCAGACCAAC 5600 

TQQCAGAAAC TOGAQGIUIT TTGQQOGAiSG C^CATCIGGA ALL'i'iUATC^ 5650 

1GQGATACAA TACnQQOQG GOCIGICA;^ GCIGOCTOGr AADOQOQQCA 5700 
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TiarriGATT GAiaQcrnr pcmx^iqccg tcadcagooc aceaacxs^ 5750 

QGCCPAMXX: TCCICITCAA CATftTIQQQS QOGrTOXJIGG CIQOOCaQCT 5800 

OQCOQCXDCXDC QGIODOGCm CIQCXJi'i'iUr Q3GnQC1GGC ClSGCTOaOG 5850 

OCX30CATOGG CAGOGnOGA CIG3QGAAGG TOCTOSIQGA CAnCITQCA. 5900 

QOGTAiaaos oaoQCGroac gogaqcicit geagcatica. agaicatcag 5950 

OQGIGAGGIC CDCIDCS^aQG AGGAOCIGGT CAATCIQCIG OOOGOCATOC 6000 

TCTOGOCIGG AGOOCTIGm GTaQGrDGIGG TCIQaOCAQC AAaaCIGOGG 6050 

aOQCAGGriG GQGOQGGGGA QQQGQCAGIG (ZAAIQGATCA. A0CG3CEAAT 6100 

AQOcrroGcx: TCoaGoaaGA Accmurric oqogaggcac taogigoogg 6i50 

AGAGGGATOC AQGGGGOOGC GICACIGOCA TACICAQCAG CTTCACIGEA 6200 

ACrXZAQCTCC TGAGGGGACT GCATCAGIGG ATAAQCICOG AGICTAOCAC 6250 

'nDGAIQCroC GJi'iUL'iUGC TAAQGGACAT CIGQGACIQG ATATOOGAQG 6300 

TGCIGAQCXSA CnTAAGADC IGQCIGAAAG CCMOJrCNT GOCACAACIG 6350 

cxnaoGAnc ccrrroiGic cigocagoqc qogtataogg qqgiuiqqog 6400 

AQGAGAOGGC ATIATOCACA CBDQCIQOCA CIGIGGSGCT GAGA3C2CT3. 6900 

GACATCTGAA AAA03QGA0G A1GAQGATO3 TOGGTOTEAG GAOCTOCAGG 6950 

AACATCIQGA GTGGGAOGIT OOGCATEAAC GOCEACACGA QQQQGOOCTC 6550 

TACTGOQCIT CCIGQGCXI3A. ACEATAACTT OGOQCIGTOG AQQGIGTCIG 6600 

CAGAQGAATA OGIQGAGAIA AGQOGGGTGG GGGACITCCA CTAGGTATCG 6650 

OGTATCACTA CTGACAATCT TAAAIGOOCG TGOCAGAOTGC: CATCX3C00GA 6700 

ALLTi'i'iCACA GAATTQGAOG QGGIQOGOCr ACACAQGnT GOGOOCOCTT 6750 

QCAAQQOCTr QCIGaaOGAG GAQGTATCAT TCAG^^iGfEAQG ACTOCAOGAG 6800 

TACQOQGIGG GGIOGCAATT AOCTIGGGf^ QQOGAACOGG AOGTAGOGGr 6850 

GTIGADGia: ATOCICACIG AIDOCIDCCA TATAACAQCA. GAGGGGGOQG 6900 

GGAGAAGGIT GGGGAGAGGG ICAOOOOCIT CEAIGGQCAG CICCTOGGCT 6950 

AGQCAGCIUT OOGCTCCATC TCICAAGGCA ACnQCAODG CCAACX^^IGA 7000 

CTGOCrTCAC GOOGAGCICA. TAGAGGCTAA CCTOCIGIQG AGGCAQGAGA 7050 

TGGQOGGCAA CATCAOCAGG GTIGAGICAG AGAACAAAGT GGIGATTCTG 7100 

GACIOCnOG ATCOGL'i'iUr GGCAGPGGAG GATC^^GOGGG AGGICIDOGT 7150 

ACXTIGCAGAA ATICIQOGGA AGICIOGGPG ATTCGOOGGG GQQCIGOOOG 7200 

TCIGGGOGOG GCOGGACTAC AAOOQQOOGC T?^CTAGAGAC GIQGAAAAAG 7250 

OCriGACEADG AAOGAQCIGT GGIOCATOGC TGOOGGCTAC CAOCiaCAOG 7300 

GPDQOCTCCr GIGOCTCOGC CIOSGAAAAA GOGEAOGGIG GTOITCACOG 7350 

AATCAACXDCT ATCTACIGOC TIGG0GG?OZ TIGQCACXAA AAiii'lTiUGC 7400 

AGCTCCICAA CnCOQQCAT TAOGQQ03AC AATACGACAA CATCCICIGA 7450 

QcooQcoarr tcigqcigcc qqqqogacic cgaogtigag toctatiuit 7500 

QCAIGQQOQC QCTOGAQaOG GAQOCIGGGG ATOOGGATCT CAQOGAiDGGG 7550 

ICATQGIOGA CX3GICAGEAG TGGGGOOG?^ AOSGAAGATC TOGIGIGCIG 7600 

FIG. 4D 




H77C 



10 20 30 40 50 

1234567890 1234567890 1234567890 1234567890 1234567fiqn 

CICAATOICr Tm'ICCIOGA CAQOaaCACr OGICAODOOG TQCXaCIQOQG 7650 

AAGAACAAAA. ACIQCXXSm: AA03CACIGA QCAACIDSIT QCTAOaOCftT 7700 

C2^CAATCIQG TGTCATTDCAC CACITCAOQC AGIQCTIQOC iW^QOZAGAA 7750 

GAAAG?ICACA. TTIGACAGAC IQCAAGi'iUr QGACAQOCAT TACXilAaQACG 7800 

TQCICAAGGA GSICAAAQCA QOaQOGflCAA AAGTGAAGQC TAACITOCIA 7850 

TOCJCTAGAQG AAGCTIGCAG 0CTCfta30CX: OCT^AMOCAA 7900 

CJi'i'iUQCTAT QOOQCAAAAG AOJDOCXSnG OCATOOCAGA AAQQaOGTCAG 7950 

CroOGflGIQG AAAGADCTIC IQGAAGftCT^ T3I3^ACAOCA 8000 

ATAGACACm OCATCA3GQC CAAGAADGAG Gi'i'i'iUiUOG TICAQOCIGA. 8050 

GAAQQQQQGT OCTAAGGCAG CTO3ICICAT aJiUi'iUGQC GACCIQQQOG 8100 

TOOQOGTEGIG CJGAGAAGATC GCXTTCEAOG AOGrOGGTEAG CAAGCIOOCC 8150 

CTOQCOSIGA TQGGAAGCIC CEAD3GATIC CAATACICAC CAGGACAGOG 8200 

GGTIGAAnC CIOGflQCAAG QGnOGAAGIC CAAGAAGACC CQGATOGGGT 8250 

TCDGGTCAIGA TACCXDGCIGT TTIGACIOCA CAGICACIGA GAGOGACATC 8300 

C3GrrA03GAQG AGGCAATTm OCAAIGTIGT GACCIG3ACC CaZAAGOOCG 8350 

QGriGGOCATC AAGTGQCICA CIGAGAGGCT TTATCTTQGG OOOOCTCITA 8400 

OCAATICAAG QGGGGAAAAC T303QCTACC GCAGGTCGCXDG OGCX2AQCX3GC 8450 

GTACTGACAA. CEAGCTCIGG TAACAOQCIC ACTIGCTACA TCAAQGQGCG 8500 

GGCAQCXnGr 03AGCXDGCAG GGCICCAGGA CIGCADCATC CIOGTCIGIG 8550 

GGGAOGACTT AGIOGTTATC TCIGAAAGIG a3aQ3Gia::A GGAQGADGCG 8600 

GOGAQCXnGA GAGGCnCAC GGAGQCTATC AOCAGGTACT CXX30CCCCCC 8650 

C:X3G3GACXDOC OCACAACCAG AATACGACIT GGAGCITATA ACAOrAIGCT 8700 

OCIOCAAOGT GTCAGIOSCX: CAOGAOGGOG CTGGAAAGAG GGICTACEAC 8750 

CTEADQOGTG ACQCEACAAC CCQQCTOGOG AGAGQQGOGT GGGAGACAQC 8800 

AAGACACACT OGAGICAATT OCIQQCEAGG CAACATAATC ATCrnQQO: 8850 

OCACACTGTG QQOGAGGATC ATACIGAIGA OQCATnCIT TAGOGICCIC 8900 

AIAGOCAQGG ATCAGCITGA ACAGQCICIT AACIGIGAGA TCEADGG?^ 8950 

CTGCEACIOC ATAGAAOCAC TOGATCEAOC TOCAATCATT CSW^GACICC 9000 

A3TG30CTCAG OGCATTnCA CIDCACAGIT ACICIOCAGG TGAAATCAAT 9050 

AQQGIGGQOG CATOOCICAG AAAACTUGGG GIDQGGCXXT TQCGAQCTIG 9100 

GAGACADCX3G QOODGGAGGG TCOGOGCTAG GLTiUiUiO: AGAGGAGGCA 9150 

GGGCTGQCAT ATCTGQCAAG TAOJiUi'iUA ACTGGGCAGT AAGAACAAAG 9200 

CICAAACTCA CrOCAATAQC GaGQQCiaOC OGGCIQGACT TGrOGGGTIG 9250 

GnCADGQCr GGCEACAGOG QQGGAGACAT TTAnTGACAGC GIGTCICATC 9300 

CCOGGQOCXDG CiUUi'iUiUG TTTDGGCTAC TGCIGCIOQC TGCAGGGGTA 9350 

GGCAICIAOC TOCTCQQCAA QQGATCAAGG TTOGGGTAAA CACICOGGOC 9400 

TCTTAAGCXZA TTTOJiUiTl' TXTTTTTTTT 'i'i'i'i'i'i'i'i'i'i' TITriCITIT 9450 

TITTTTIXrTT TOJi'i'lUL ' iT Ci'i ' i'iTi ' lU : Ti ' iUi ' i'i'riC OCTICTITAA 9500 
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TOsroQCia: atcitaqooc tagicaoqqc taqcigigaa aggtcosiga 9550 

GODQCATCAC TQCAGAGAGT QCIGAIACIG QCXTICICIGC AGAICATCfT 9599 
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MSINEECPi;3?K TKRNINRRPQ DVKFPQQQQI VG3^YLLFBR GPRDSVRMR 
KTSEE^EFG RI^PIPKAER PEEKIWAQPG YEWPLY3SIEG a3'\lAGWLLSP 
EGSRPSWGFT DERRFSRNDG KVIETTCICGF AEOGYIPLV GAPLGGAAEA 
LAH3VFVLED GVNYATCiNLP GCSFSIFLLA LI^CLTVPAS AYQVRNSSGL 
YHVINDCENS SIVYEAADAI mTPGCVPCV RBSSIASEO^ AVTPIVATOD 
GKLPTIQIiRR HIDLLVGSAT DCSALYVGDL OGSVFLVQQL PIFSPREH^ 
TQDCNCSIYP GHTHaiFMAW EMMMNWSPm ALWAQLLRI Pi^aMEMIAG 
AEi&N]GVLAGIA YFa^^GSSMMC VLWr.TJ.FAG VDAEIHVTOG NAGRITftGLV 
GLiTPGAKQN IQLININ3SW HINSrAIJSlCN ESTNIGWLAS LFYl^KFNSS 
QCPEKLASCR RLTDEAQGWG PISYANGSGL DEKPYCWHYP PF5P0GIVPAK 
SV03PVYCFT PSPWVGITD RSGAPIYSWG ANEJniVFVIN NIRPPDSSK^ 
GCIV\MSrSIGF TKVQGAPPCV IGGVGNNTLi. CPICCFERKHP EATYSPCGSG 
FWl'i'PKOyiVD YPYRLJWHYPC TINYTIFKVR MifVQGVEHRL EANHMTPGE 
BCDUEDPTES ELSPLLLSTT QWQVLPCSFT TLPALSIGLI HLH^^JTJT^JQ 
YLYGVGSSIA SWAIKWEYW LIFLLLADAR VCSCEJAWMLL ISQAEAAL^N 
LVXLNAASLA GIH3LVSFLV FPCFAWYLKG FWVPGAVYAL YG^S^ ,T .T J T , 
LALPQRAYAL DTEVAASOGG WLVGUXIALT LSPYYKRYIS mmKQYFL 
TFiVEAQLHVW VPPUSiVFaGR DAVILIMIW HPTLVEDITK TJ.TAIFGPDW 
HjQASIJLKVP YFVFVQGLLR ICALARKEAG OiYVQMAIIK IGALTCTYVY 
NHLTPLFinAlA HNGLERDLAVA VEPWFSFME TECLPIWGAEn? AAOSDIINGL 
PVSARR3QEI LLGPADGMVS KGWRLLAPIT AYTMQQTFiGLL GCIITSLTSl 
DKN3VEGEVQ TJSTPnXJTFL ATCINGVCWT VYHGAGTRIT ASPKGPVIQM 
YTIWDQDLVG WPAPQGSRSL TPCICGSSDL YLVIRHAEVI P^;RRR£33SPG 
SLLSPRPISY LK3SSQGPLL CPAC3CWGLF RAAVCHRGVA KAVDFIR/EN 
DCJTIMRSPVF insrSSPPAVP QSPQVAHim PIGSC3<STE<V PAAYAAQGYK 
VLVLNPSVAA TDGFGAYMSK AHGVDHSURT GVRiTi'iLSP ITYSIYC3<FL 
ADGQCSGGAY DIIICDECHS TDATSIDGIG TVLDQAETAG ARLWIATAT 
PPGSVWSHP NIEEVALSTT GEn:PFYC3<AI PLEVIK3C3RH LIPCHSKKKC 
DELAAKLVAL GINAVAYYPG irVSVIPTSG E^AAA/SITiAL MIGPTC3DFIDS 
VIDCNICVTQ TVDFSLDPTF TIEriTLPQD AVSRIQKFiCSl TC2=0<PGrYR 
FVAPGERP9G MFTDSSVDCBC YDAGCAWYEL TPAETIVRLR AYMSTTPGLPV 
CQEHLEFV\JB3 VFIGLTHim HFLSQIKQSG E3SIFPYLVAYQ AIVCARAQAP 
PPSWD3y&\IKC LIFLFnim PTPLLYRDGA Vl^vEVTLIHP nKYIMICMS 
ADLEWTSIW VLVQGVIAAL AAYCLSIQCV VTVOU^TLSG KPAIIPDEREV 
LYQEFIDEMEE CS^iLPYIEQ C3ynyiLAB2FE<Q KALT^ ,T QTAS RHAEVTTPAV 
QHSIWQKLEVF WAKHM^vZNFTS GIQYLAGLSTr LPGNPAIASL MAFTAAVTSP 
LTIGQTLLFN HjGGWVAAQL AAPGAATAFV GAGLAGAAIG SVGLC2<VLVD 
ILAGYGAGVA GALVAFKIMS GEVPSTEDLV MiPAILSPG ALWGWCAA 
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ILRRHVGPGE GAVQC'^MMRLI AEASROSIHVS PIHWPESEA APF^TTMISS 
LTVIQLLRRL HQWISSBCTr PC9GSWLERDI WDWICEVLSD FEdWIiCAKm 
PQLPGIPFVS CQFGYPGVWR GDGIMHIECH OGAhLL'iUHVK NGIMRIVGPR 
TCRNMASCJnF PIMOTIGPC TPLPAHSIYKF AUWFIVSAEEY" VEIFES/GDFH. 
WSGMnnSIL KCP03IPSPE FETELDGVRL HREAPPCKPL liE^EEVSFFWG 
LHEYPVGSQL PCEPEEO/AV LTaXLTDPSH TTAEAAGRRL ARGSPPSMAS 
SSASQLSAPS UMCISVNHD SEEX^ELIEAN LIM^EM3C3Sr TTFS/ESENKV 
VXLDSFDPLV AEEDEEIEVSV PAEHEKSRR EARALPVWAR FDaSCPPLVET 
WKKPDYEPPV VHGCPLPPER SPPVPPPRKK RTWLTESTL SEALAELATK 
SP3SSSTSGI TCCNnrSSE PAP9QCPPDS EVESYSSMPP LBSEPGDPDSL 
SDGSWSIVSS GAUi'ELWOC SMSYSWIGAL VTPCAAEBQK LPINALSNSL 
LRHHNLWST TSRSAOQPQK KVit'UHLQVL DSHX^P/Ii^E VKAAASKVKA 
NLLSVEEACS LTPEHSAKSK PGYGAm/PC HARKAVAHIN SVWEraJLSDS 

vrpmrnm knevpcvqpe kqcsrkparli vfpexjsvrvc EKWALmA/s 

KLPLAViyGSS YGFX2YSPQQR VEFLVQASAIKS KKTEMSFSYD TE^CFDSIVTE 
SDIRIEEAIY QOCDLDPQAR VAIKSLTERL YVQGPCINSR GEN03YFPCR 
ASSVLTTSCXS NTLTCYIKAR AACRAAGLQD CIMLVDCXDL WICESAGVQ 
EDAASIiRAFT EAMIRYSAPP GDPPQPEYDL ELITSCSSNV SVAHDGAGKR 
VYYLTERDPIT PLARAAWBTA RHTPVNSWLG NUMFAFTLW ARMimiHFF 
SVLIARDQLE QALNCEIYGA. CYSIEFODLP PIIQRLH3LS AFSLHSYSP3 
ErWRVAACLR KLGVPPLERAW RHF5ARSVRAR LLSF!QC3RAAI a3<YLFTSIWAV 
RTKLKLTPIA AAGE^LDLSO^ FTAGYSQODI YHSVSHARPR WEWCLLLLA 
AGVGIYLLEN R 
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QOCAQGQGCX: TCMI3G999C GACftCTCCftC C3VTC?\ftfICAC TCOOCIGIGA 50 

QGAACTACTG TCTTCPCGCA GAAAQOSICT AQOCAIQaOG T33^GEAIGAG 100 

IGiaJOaCAG OCIDCAQGAC OODCXCTGCX: QOGAGAOXA TAGIQGICIG 150 

a3GAACXI3CT GAGTCACACQG GAATIGOCAG GAOGADOQQG TCCrTTCTIG 200 

G?VICAACJOQG CTCAATQCXIT QGAGATITOG QOGIGOQQGC QQGAGACIQC 250 

GIGriGQGIC GOGAAAGQCC TK^IGSTACT QOCIGATAQG 300 

GTOCITCQGA GIX300CXI3QG AGGIdOGfEA. GAOOGIGCAC CAIGAQCACG 350 

AATOCTAAAC CICAAAGAAA AACCAAAOSr AACACCAACC QQQQCXXIACA 400 

QGAOGICAAG TTOCOQGQQG GIQGICAGAT CXSnOGIGGA GflTTACJCTOr 450 

naOOSCXSCAG OOQCXXrAGG TIQQGIGIQC QOQCJGACrAG G?^AQQCTIDC 500 

GAQGQGfrGGC AADCTCGflQG AAQQOGACAA OCTATOCTAA AQGCICX3CX33 550 

ACQCX^AGQQC AQQQCXITOQG CTCAQCXXI3G GTACQCnOG CCXTTCTATC 600 

GCAATGAGQG CCTOOQGTQG GCAGGA3GGC TCCTGICAOC QCGCXSOCTOC 650 

CXBOCCTAGrr GGQGOCXXAC GGACXXXrX3G OSTAGGTOaC Gn^^ACTIOaG 700 

TAAQGICATC GATAOXTIA CATaDQQCIT CX3Q0GATCIC ATGQQGTACA 750 

TroCX3CIOCT CQGCGCCOCC CIAQQQQQOG CIGQCAGQQC CTIGQCACAC 800 

GGTGTa:x3QG TTCIGGAQGA CQQQGIGAAC TATOCAACAG GGAACnOOC 850 

CQGTIGCICr TIXTICTATCT TCTTCTIGQC TCDSCIGTOC TUi'i'iGACCA 900 

TCCCAGCnC CX3CriATGAA GIGOGCAAQG TGIODQQGAT ATACCATCTC 950 

ACGAACXSACr GC7IOCAACIC AAQCATIGIG TATGAQQCAG CX3GAQGIGAT 1000 

CATCCATACr CCQ33JV30G TCCQCIGIGT ICAQGAQQC?! AACAQCIOQC 1050 

GTTGCroQGrr AGQGCICACT CCCAaSCTQG a3QCCAGGAA TOQCAGCX3IC 1100 

CCXIACTAQGA CAATACX3AC3G CCACX?IOGAC TTGCTOGTIG QGACQQCIQC 1150 

TTKJIOOICC QCTATCTACG TGaOOGATCT Cn3CX3GAICr ATTTTDCTIDG 1200 

TCTCCCAGCr GnCAOCnC TGGCCIOQOC GQCA1GAGAC AGraCAQGAC 1250 

TGCAACIGCr CAATCTATQC a3Q0CAn?rA TCAQCTCACX: GCAroQCTIG 1300 

QGATATCATG ATGAACIGGT CACCIACAAC AQCXXTAGIG GIUnDQCAGT 1350 

TCCroOQGAT CCXIACAAQCr GIDGIQGACA TGGIQQOQQG QQQQCACIQG 1400 

QGAGICCIQG CXSOGQCITGC CTACTATIDC ATOCTAQQGA ACIQQQCTAA 1450 

QGnCIGATT Gia3CX3CTAC TCTTIQOCXaG CXJCTGAQQQG GAGAOQCACA 1500 

QGAOQaOGAG QCT3aaCX3QC CACAOCACCT QOQQGriCAC GIDOJiTi'iC 1550 

TCATCroOQG CCTCTCAG?^ AATCCAQCIT GIGAATAOCA AQOaCAQCIG 1600 

GCACATCAAC AGGACTGOCX: TAAATT3CAA TCACIOXTC CAAACroOGn? 1650 

TUmGOOGC QCIUi'i'i'iAC GCACACAAGT TCAACTCGIC CXBQGIGQGOG 1700 

G.^iGaXA'IGG CCAGCTGCXDG CXDQCATIGAC TQGITOQOCX: AQQQGIGQQG 1750 

CCCCATCACC TATACIAAQC CIAACAQCIC QGATCAGAQG CCTTATIQCr 1800 

QGCATTAOSC GCXTIDGACXXS TCT3GIGTDG TACXTGOGTC QCAQGIGIGT 1850 

GGICCAGIGT ATT^nTCAC CCXAAQCXCT GnUIQGIQG QGADCADOGA 1900 
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TCGiraDOCT GTCCCmDJr ATAOCIGaOG QGftS^ATCftG ACAGA03IGA 1950 

T3CIQCTCAA CAACAGQOGr Oa3QCACAAG GCAACIQGIT 03GCIGEACA 2000 

TOGAIGAATA. GTACIGQCJET CACI3^AGADG TQCX3GAQGIC COOOJiUllAA 2050 

GATCGQQQQG GTCQGEAAOT GCACXTTIGAT CIGOCOCACG GACIGC'l'iUC 2100 

QGAAGCADOC CGAQQCTACT TACACAAAAT GIQQCia3QG GOOCIQGTIG 2150 

ACAOCTAGCT GOC33^GIAGA CIMCXSm^ AQC3L'i'i'iUQC ACEACXXXTCG 2200 

CACiciCAAT TrrracATCT TmM3Grmj GftiiGimuiG QGOoaasioG 2250 

AQCACAQQCr CAAIQOOQCA. TGCAATIGGA CIOSAQGAGA QOQCIGmAC 2300 

TIQGAQGACA GQGATAGC3IC AGAAC7ICAGC OCXXTTQCIQC TCICTAC?^ 2350 

AGAGIQQCAG ATACIGOXT GIQCTTICAC CACQCTAOOG QCTITATOCA 2400 

CIGGITIGAT OCMCIOCAT CAGAACATCG IQGACGIQCA ATAOJiUJiAC 2450 

GCjnjCAQQCT CAGOGITIGr CiUL'i'i'iUCA ATCAAMTOQG i^^^IACATCCr 2500 

Ui ' lULTri'i C Ci'iCICCIQG CAGADQOQQG QGIGIGIQGC TQUi'iUiUGA 2550 

TGATOCroCr GATAQOQCAG QCIGAQQCX33 CXTITAGAGAA. CTIQGIQGIC 2600 

CICAAIQ03G OGiraJCQQC QQGAGCXXIAT GSimTCICT QL'i'i'iUi'iGT 2650 

Gl ' ICTI CIQC QCCQOCIQGr ACATTAAGQG CAQQCTIQQCr aCTGQQCaOQG 2700 

CCTA-TGCnT 1TAT3GCXJCA. T3QCX:X3CIGC TCXTIGCTOCT ACIQQCX3ITA 2750 

CCACCADGAG CITAOQCXITr OGAOCXaQGAG ATOGCIGCAT OGIGOGQGQG 2800 

TGOOJi'iUi'i' GTAGOICTOG TATIUITGAC CTIGICACXZA TACEACAAAG 2850 

TCTTTCICAC TAGGCICATA T3GTGGITAC AATACTITAT CAOCAGAGCX: 2900 

GAGGCGCACA T3CAAGIUIG GGIOCXXQOC CICAAOmC GGGGAGQQOG 2950 

CGATGOCATC ATCCIOCICA CGIUIGOQGT TCATCCAGAG TEAAi'i'i'i'iG 3000 

ACATCACCAA ACTCCIGCIC GOCATACIOG OXTGCTCAT GGTGCIDCAG 3050 

GCTQGCATAA OGAGAGIGCX: GTACmOGIG OSGQCICAAG QQCICATIDG 3100 

TGCATGCATC TTAGIGCX^AA AAGTOQCXDOG QQGICATTAT GICCAAATOG 3150 

TCTICATCAA GCIGGOQGQG CIGACAGC?EA GCTADGITEA TAACCATCIT 3200 

ACQQCACIGC aOGACIOGQC OCADGOGGQC CTAOGAGADC TIGCX3GIGQC 3250 

QOTAGAGCXX: GTOGIUTICr CXX3QCA1GGA. GACXAAQGIC ATCACCTGQG 3300 

GAQCAGACAC OQCIGCXnGr QQGGACATCA TCTIGGGIUr NXXXJTCTCC 3350 

QOCXDGAAQQG GGAAQGAGAT Ai'iTi'iQQGA (DOQGCIGATA GTCIDGAAQG 3400 

GCAAGGGIQG QG?CIDCITG aSQCCATCAC GQQCTACrGC CAACAAADQC 3450 

GGQQCGTACr TOGITQCATC ATGACTAGGC TCACAQQCXX3 GGACAAGAAC 3500 

CAQGTGGAAG QQGAQGTTCA AGIGGITICr ACQQCAACAC AATCITIOCT 3550 

QGOGACmOC ATCAAD3G0G TGIGCIGGAC TGICTADCAT GGOQCIGGCT 3600 

OGAAGACXXT AGCCX3GI1CCA AAAQGKXAA TCADCCAAAT GTACADCAAT 3650 

GTAGACCIGG ACCIOJDQGG CTQQCAQGOG CQQOCOQQQG CX3QQCrCCAT 3700 

GACACCATGC AGCIGTQQCA GCIOGGACTT TTACnOGIC AC3GAGACATG 3750 

CTGATGICAT TODGGKXDQC CXSGQGAQGCX; ACAQCAGOGG AAGTCTACTC 3800 
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1COOCCN3GC (XXJDCTCCTA. OCIGAAAQQC 1DCI03QC3IG GTOCATIQCT 3850 

TIGCXXnOS GQQCAaSiaG TX33QaGICIT OOQQQCIQCT GIGIQCAOOC 3900 

GaQ3GC3I03C GAAQQOaGIG GACITCATAC COG?nGAGfIC TATOGAAACT 3950 

ACX^^IQQQCT CIO0QC3ICIT CACAGAC?^ ICAACXXXJOC a3QCTGfEAOC 4000 

QCAGACATIC CAAGTOQCAC ATCIQCAOSC TOCTACIQaC AGOaOCAAGA. 4050 

QCACCAAAlSr QCX3QQCIQCG TAOXSCAQCXDC AAQQGIACAA QGIGCIOGIC 4100 

CIGAACXX3CT CXCTIQOOaC CAC3CITAQQG TnOQGaCGT AJaiGlOCAA 4150 

GGCACACXSCT ATCGAOOCIA ACATCAGAAC IGGOGTEAAGG ADCATEAC3CA 4200 

CX9GGCX3GCIC CAflTMCTAC IDCACCISmS GCS^AGITOCT T30CX3Aa3C3r 4250 

GGCIGTICIG GGGOOGOCTA. TGACATCATA. ATATOIGAIG AGIQOCACIC 4300 

AACIGACID3 ACTAOCAICT TOOQCAOrGG CACAGIOCIG GAOC2\AGCGG 4350 

AGA03GCIGG AGOGOQQCIC GIOGIQCIOG CCACDGCTAC AaCIDCX39GA. 4400 

TDQGITADGG T30CACA0QC CAATATO3AG GAAATAQQOC TGIOCAACAA. 4450 

TGGAGAGATC aXTICTATC GCAAAGOCAT OCXXMnGAG GQCATCAAQG 4500 

GGQQGAGGCA TCICALL'i'l'iC 1X3QCATIOCA. AGAAGAAATC 'IGACG2^GCIC 4550 

GCQQCAAAGC TGACAQGCCT CX9GACIGAAC GCIGEAGCAT AITADCX3QGG 4600 

CCTTGATCIG TCOGTDCATAC CGCXrrA.TO3G AGADGTOCTT GID3IQGCAA 4650 

CAGAQQCICT AATGACX3QC?r TTCACXDGGCG Ai'i'i'iGACIC AGIGATCGAC 4700 

TCCAATACAT GIGICACQCA GACAGIOGAC TICAGCnQG AiaX^^CCIT 4750 

CAQCATIGAG AQGA03ACCG TQCXXXiZAAGA aSCQGrDGTDOG GGCTGGCAAC 4800 

GGCX3AGCTAG AACIGGCAQG GCTAGGAGIU GCATCTACAG (Jl ' l ' lUl GACT 4850 

CXAQGAGAAC GQOQCrGQQG CA.TC?nDGAT ICnDQGICC TCICT3AGIG 4900 

CrATCAa30G GQCIGTGCrr GGfTATCAGCr CAOQC3Q03Cr GAGACX:nXI3G 4950 

TTAQGriGOG GGCITAOCrA. AATACAOCAG GGTIGCOCCT CIQQCAGGAC 5000 

CATUTOGACT TCIGQGAGAG CCTCTICACA GOQCICACQC ACATAGAIGC 5050 

CCACITCCrc TOOCAGACIA AACAQQCAQG AGACAACTIT QCTEACCIGG 5100 

TGQCATATCA AGCTACAGIG T3CX30CAGaG CTCAAGCIOC ACCTOCATCG 5150 

TCQGAOCAAA lOTOGAAGIG TCICATAOQG CIGAAAQCTA CACIGCAD3G 5200 

GCCAACAOCX: CIGCIGTAEA GGCEAQGAQC OGIOCAAAAT GAGCJECAIDC 5250 

TCACACACCX: CATAACTAAA TACATCAZOG CATOCAIGIC GQCIGAOCIG 5300 

GAGGIDGfICA CEAQCAQCIG QGIGCiUGJiA GQCGGAGTOC TIGCAGCnT 5350 

GGCQGCATAC TGCCIGAQ3A CAQQCAGIGT QGICATIGIG QQCAQGA3CA 5400 

TCi'iUiODGG GAAQQCAQCT GTGGriDOCX; ACAQOGAAGT CCICTACCAG 5450 

GAGTIOGATG AGATGGAAGA GTGIGQCICA CAACnDCTT ACATCGAQCA 5500 

GQGAATGCAG CrGQODGAGC AATICAAQCA AAAQGOGCIC GGUi'iUi'iGC 5550 

AAACGGQCAC CAAQCAAQQG GAQGCIGCIG CTOOCGIGJT GGAGTOCAAG 5600 

T3QCXSAGCXX TTGAGACCTT CIGGQQGAAG CACATGTOGA ATTICATCAG 5650 

CXSaz^TACAG TAQCTAGCAG GCTTATXXAC ICIGQCIGGA AACXDCXDQOGA 5700 
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TAQCATCATT GAiaOCAnT ACAGCi'iCiA ICACEAGCXDC QCTCACCAOC 5750 

CAAAACACQC TCCIGTITAA CATCTIQQQG QGAIGSSEOG CIGCXXAACr 5800 

QQOTXTCCX: AG0GCIQC3CT CAQLTi'iLGT QGQOXXXSQC ATOQOOGGAG 5850 

OSGL'iGl'iGG CAGCAmSOC CnOGGAAGG TQCIOSIQGA CALLUi'iUQQG 5900 

QQCTATOOGG CAQGOGEAGC OQQOQCACIC GIQOaCTrm AGGICATCAG 5950 

CX3Q0GAQGIG OCXTIOCAOOG AQGAOCIQCT CAACITACIC OCT30CATOC 6000 

TCIUIDCIQG T30CrTOGIC 0103000103 1GIQ03CAQC AATACIQOCT 6050 

aSGCAOOIQG QOXGOGAGA GOaOQCIGIG GAGIGGAIGA ACJOQQCIGAT 6100 

AQOGnOOCr TCOOOGOGm A0CA03ICIC aXTAOQCAC TAIGIOXTC 6150 

AGAQ03A0QC TGCMCACXir GICACICAGA TOJiUiL'JJAG CTTEAOCATC 6200 

ACICAACIGC TGAAQOaOCr CCAGCAGIQO AITAATCAQG iCTOCTCEAC 6250 

GCCA!IGCrCX: QQCI03IGGC TAAGQGAIGT TIQGGAnGG ATATOCAOOG 6300 

TCITGACIGA CriCAAGACC TOGCTCCAGT OCAAACICCT QOQGQGGITA. 6350 

CXDGGGAGnx: CirroCIGTC AT3CX:AACX3C QGGTACAAQG GAGICIGGQG 6400 

QGQGGADQQC A1CA.TOCAAA CCMXTTGCCC ATOCGGAGCA CAGATCQQQG 6450 

GAGATCICAA AAAOQCJnOC ATCAG3ATCG TAGC33CXJ[AG AAQCIGCAQC 6500 

AACAOSIQGC ACX3GAAOGrT CCXXATCAAC QCATACACCA a3GGADCTIG 6550 

CACACCXno: CQQGCX3CXXA ACTATIOCAG QQQOCTAIGG CQGCT3QCIG 6600 

CTCAQGAGIA OOrOGAQGIT ACGCGIGIQG QQGAITKXA CTAOCTGACG 6650 

GGCATGAOCA CTCACAACnr AAAGIQOCCA TGOCAQGnC OQGCXXXX3GA. 6700 

ATICriCACX5 GAQGIQGATG GAGI13QQC?IT GCACAQCTAC QCTCCQQCGr 6750 

GCAAAQCTCr TCIADQGGAG GACXJTCAOOr TOCAGCTOGG QCTGAACCAA 6800 

TACTIGGIOG GGTOGCAQCT CQCA1GOGAG CXXXSAACXDQG ACGTAACAGT 6850 

GCTTACriOC ATGCICACOG ATOQCTOQCA CATTACAGCA GAGADGQCIA 6900 

AGQGTAGGCr GGdAGAQGG TCTOOOQCCT CITIAQQCAG CICATCAQCT 6950 

AGCXIAGnur CTOOGQCnC TnGAAQQQG ACAIGCACTA QOCADCATGA 7000 

CTOCXCTGAC GZDGACCICA TGGAGGQCAA OCTCTTCflGG OQGCAGGAGA 7050 

TGQGOGGAAA CATCACTOOC GIGGAGICAG AGAATAAQGT AGrTAATICIG 7100 

GACTCTTTOG AACX:GCnCA CGCGGAQGQG GAIGAGAQGG AGATATCQGT 7150 

CGOQGQQGAG ATOCTGOGAA AATCCAGGAA GTICQQCICA GGGnOCQCA 7200 

TA1GGQCADG CCQGGACIAC AATCCTQCAC TGCTAGAGIC CIGGAAQGAC 7250 

COQGACIADG TCCCTGCXSGr GGTACACXSGA TGCXDCATIGC CADCTAOCAA 7300 

GGCTGCTOCA ATAOCACCTC CACGGAGAAA GAQGADQGIT GIDCIGACAG 7350 

AATGCAATGT GICTICIGQC TIGGQQGAGC TGQQCACIAA GAOCTTOGGr 7400 

AGCTGOOGAT CETDGGQQQGT TGATAQOGQC ACX3G0GAD0G COCTIDCTGA. 7450 

CCIQQOCTOC GAQGAa3GIG ACAAAGGATC QGACGTIGAG TCGIACTGCT 7500 

CCATGCXirXX CCITGAAGGG GAQQQGQGQG ACOCQGATCT CAGQGADGOG 7550 

TCTIGGTCTA CQGIGAGTGA GGAQQCTAGr GAG3ATGTDG TCIGCIQCTC 7600 

FIG. I4D 




HC-J4 



10 20 30 40 50 

1234567890 1234567890 1234567890 1234567890 1234567890 

AATCIOCTAT AOGIQGACftG GOSCXXnGftT CAOaOCAIQC QCIQ03GAGS 7650 

AAAGTAAGCr GQQCATCAAC OaJTDGAGCA ACICITIQCT Q03ICA0CAC 7700 

AACATOGICr Aa3GCACAAC A1CCXDQCAGC CXS^AQOCna: QQCAGAAGAA 7750 

GCJECAOCrrr GACAGATIQC AAGTOCIGGA TGATCATTAC OSOGACXJC?^ 7800 

ICAAQGAGAT Gg^AGQOGAAG QC3GIDCACAG TIJ^AQQCTAA. Q LTlUm iCr 7850 

imO^QGAGG CCIQCAAQCr GAOaOOOOCA CATTCGOaCA AATCCAAATT 7900 

TQQCEAIQGG GCAAAQGADG TOOQGAADCT AIOCAQCftQG QOCXirmAOC 7950 

ACATC03CTC OGIGIQQGAG GAL'i'iUL'iUG AAGACACIGA AACAOGAATT 8000 

GACAQCACX2A ICAIOQCAAA AAGIGAQGIT TICiaOGICC AAQCAGAGAA 8050 

GQGAGQCX:X3C AAQOCAGCIC GQCTIATCGr ATTCOCAGAC CIQQGAGTIC 8100 

GIGEAIGOGA GAAGAIGGOC Ci'i'lACTGAOG TOGTCTOCAC OnTQCICAG 8150 

GCXCTGA.TOG GCroCICAEA CGGATTICAA TACIDOQOCA AGCAGGGOGT 8200 

OGAGrroCTC GIGAATACCr GGAAAICAAA GAAATOGOCT ATOGQCTICr 8250 

CATATCACAC (XX3CIGITIT GACICAA03G 1CACIGAGAG TGACATIDCSr 8300 

GTIGAQGAGT CAATITACCA AlUi'iUiGAC TIQQOCXXIDG AGQCX2^GACA 8350 

GGQCATAAQG TOGCICACAG AQOGGL'i'i'iA CATOQaQQGr COQCIGACTA 8400 

ACTCAAAAGG GCAGAACIGC GGTIAIDGOC QGIGGCXaCQC AAGIQGGGIG 8450 

CTGAOGACTA GCIODGGIAA TAQQCTGACA TCITACrTGA AGG0GACK3C 8500 

AGOCIGTOGA GCIQCAAAGC TOCAQGACIG CAGGATOCIC GIGAADQGAG 8550 

AQGACCTICT OGTTATUTiGr GAAAGOGCm GAAOOCAQGA QGATOQQQCX3 8600 

GCCCTACGAG CCTICAOGGA GQCTATGACr AGGTATIDOG OCCQOQQOGG 8650 

GGA.Tm3CX:C CAAOCAGAAT ACGAOCIQGA GCIGATAACA TCAIGTTOCT 8700 

CCAATGIUIC AGIO30GCAC GAIGCATCTC GCAAAAGQGT ATACIACCIC 8750 

ACXX3GIGACC (XAOCAOQQC CCnGCAOSG GCT3QGIQQG AGACAQCEAG 8800 

ACACACIXXA ATCAACICIT GGCTAGGCAA TATCATCATG TATOQGOCCA 8850 

CCCTAIlGaGC AAQGATCATT CIGATGACIC ACl'i'i'i'iCIC CATCCl'iCiA 8900 

GCTGAAGAGC AACmGAAAA AQOXTOGAT TCICAGAICT AOGQGQCrTC 8950 

CTACroCATT GAGCXZACITG ACX^DCTCA GATCATTCAA C3GACIDCATG 9000 

GIUTEAGGGC ATTTACACrc CACAGnTACT CTQCAGGIGA GATGAATAGG 9050 

GTOQCnCAT GQCICAQGAA ACnOGGGIA CCADQCTIQC GAAOCIOGAG 9100 

ACATO3QG0C AGAAGIGTCC GOQCTAAQCr ACIGIDOCAG GaOGGGAOGG 9150 

CCQCCACITG TQQGAGATAC CTCTITAACr GGGCAGTAAG GAOCAAQCIT 9200 

AAACICACTC CAATCQQGQC CQOGIOGCAG CIGGACTIGT CIQGCIQGIT 9250 

CGIDGCIGGT TACAQCXSGOG GAGACATATA TCACAQOCIG TCIDGIQQQC 9300 

GAOOQQGCIG GTITCQGrilG TGQCIACrOC TACTTIUIGT AQGGGTAGGC 9350 

ATTTAOGIQC TOOOCAACOG ATGAAOGGQG AGCTAADCAC TQCAQGOCIT 9400 

AAGOCATTIC ClUITITTTr 'i'i'i'i'i'l'i'i'iT TITITTTTIT TUi'iTi'i'i'i'i' 9450 

TITCTTTCCr TTGCriUTTr TTrroCTTTC 'i'i'iTiOOCrr CITTAATGGr 9500 
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QQCIDCATCT TAGGCXTEAGTr CACX3QCTAQC IGIGAAAQGT aDSIGAQODG 9550 
CATCACK3CA GAGAGIQCIG ATACTOQCCT CICIQCAGAT CAIGT 9595 
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MSmsIEKPQRK IKRNHsKRPQ UJKFPGO^Ql VQGVYLIiFRR GFKLiSVIRAIR 50 

KPSERSQFPG RRQPIFKARR PBoRAI^\P^PG YEWELYGSNEG LGt^aGEftlLLSP 100 

RGSRPSWGFT DEE?RRSRNLG KVIUilLICC^' AEOG^ELV GAHjQGAAEA 150 

LAHS^/RVLED GVlSnianX3S[LP QCSFSIFLIA LLSCLTTPAS AYEVRNVSGI 200 

YHVUSIDCSNS SIVYEAAEVI MEirPQCVPCV QBGNSSRCWV ALTETLAAFN 250 

ASVPi'i'i'lKR HVDLiLVGIftA PCSAMYVGDL CGSIFLVSQL PIFSPERRHET 300 

VQDOSJCSIYP GHVSSiRMAW TM^MsMSPTT ALWSQLLRI PC^^WEMWAG 350 

YYSMVC3SRaP^ VLIVAIjLf2^ VEXih'lHi'iUR VZOilTSGFT 400 

SLFS9GA9QK IQLVNINSSW HINFCCAIICN DSLgiUttlAA. LF!OVHKFlSISS 450 

GCPEKMASCR PIDWEAQGWS PriYIKFNSS DSE^PYCV^KA EE^POGWPAS 500 

QKXSWYCBT PSPWVGTID RSGVFIYSWG ENEiniWMLLN NIRPPQC2QWF 550 

QCTV^yiGb' TKTOQGPPCN lOGVaSRIU: CPIDCFRKHP EAIYIKGGSG 600 

FWLTPRCLVD YPYRLWEIYPC TLNFSIFKVR MWQGVEHRL NAAOSlWmSE 650 

RCMLEOREeS ELSPLLiLSIT EWQILPCAFT TLPALSTCEjI HLFO^TEVCVQ 700 

YLYGVGSAFV SFAIKWEYXL liFLLLADAR VCACL&\WMLL lAQAEAALEN 750 

LWLNAASVA GAHSZLSFLV FFXIAAWYIKG RLAPGAA:£AF vraAAT Pr J ,t ,t J . 800 

LALPPRAYAL DREMAASOQG AVLVGLVFLT LSPYYKVFLT RLIWWLQYFI 850 

TRAEAHXQVW VPPLNVR3GR DAIZLLTCAV HPELIFDITK LUIAILGPLM 900 

VDQAGnWP YFVRAQCT.tr AOMLVRKWAG C3iYV^3^IVnyiK DGALTCTYVY 950 

NHLTPLRDWA HAGLRDIAVA VEPWFSAME TKVnWGADT AAOaDIIUGL 1000 

R^SARRGKEI FDGPADSLBG QO^RIiAPIT AYSQCflRGVL QCIITSLTGR 1050 

DKIsK2VE3EVQ WSIATQSFL AldNGVOAn? VYHSAGSKTL AGPKGPIT\34 1100 

YINVDLDLVG WQAPPGARSM TPCSOGSSDL YLVIRH?WI PVRRRGDSRG 1150 

SLLSPRR/SY LKGS9QGPLL CPSOiWGVF RAAVCIRO/A KAVDFIPVES 1200 

METIMFISPVF TCNSTPPAVP QTFQJIKHLHA PTGSGKSIKV PAAYAAQGYK 1250 

VLVLNPS\/AA TLGFXSAYMSK AHGIDHSTTRT GVRiTi'iUGS ITYSTYGKFIj 1300 

ADGGCSGGAY DIIICDEJCHS TDSTTILGIG TJUX^^EH^ AKLWIATAT 1350 

PPGSVTVPHP NIEEIGLSNN GEIPFYGKAl PIEAIKQGRH LIPCHSKKKC 1400 

DELAAKLTGL GDSIAVAYYRG Ii:^7SVIPPIG EWWATDAL MIUb'iGUFES 1450 

VUXNVC^/IQ 1VDFSLDPIF TlKi'i'iVPQD AVSRSQRRC31 T30RSGIYR 1500 

FVTPGERPSG MFDSSVLCETT YDAQCAWYEL TPAETSVRLJR AYLNTPGLPV 1550 

CQIHLEFWES VFIGLTHTDA HFXSQCEKQAG ENTPYLVAYQ AIVCARAQAP 1600 

PPSWEJ^yS^C LIRLKFTLHG PnTPLLYRLGA V!3>3EVIL.1HP ITECYIMACMS 1650 

ADLEWTSIW VLVQGVLAAL AAYCLTIGSV VIVGRIILSG fCPAWPEflEV 1700 

LYQETOEMEE CASQLPYIBQ Gt^IABQFKQ KALGLU^EAT KQAEAAAPW 1750 

ESKI/\JRALETF WAKHM^NFIS GIQYLAGLST LPC3SfPAIASL MAFTASITSP 1800 

LTTOOTLLFN ILG3W\7AAQL APPSAASAFV GAGIAGAAVG SIGDSvVLVD 1850 

ILAGYGAGVA QZ^VAFKVMS GEVPSIEDLV MiPAILSPG ALWGWCAA 1900 
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HjRRHVGEPGE gavqwmskli aft^srosihvs pihyvpesea aafiviqilss 

LTTIQIJLKRL HSWINEDCST PC9GSWLREV WDWICIVLTD FKIWU^SKLL 
FPUPGJPFLS QQRGVKGVWR GOSIM^riCP CGAQIAGHVK N3SMEm;GER 

'ICS^7IM^3^F pinaytigpc tpspaenysr adwrwaaeey vevirvgdfh 

YVTGMTinSlV KCPOgVPAPE FFTEVDGVRL HRSdAPACKEL LREUTTFQ^ 
LI^QYLVGSQL PCEPEPDv/IV LTSMLTDPSH ITftEnCAKREL ARGSPPSLAS 
SSASQLSAPS n^ATCTTHHD SPDADT.TKAN IiWE^EM3C2^ ITPVESENKV 
VILDSFEPLH AEr33EREISV AAEHJRKSRK FPSALPIWAR PnVMP PT.T.Rg 
WEODEDYVPFV VH3CPLPPIK APPIPPEE^RK RIWLTEaSIV SSAIAELAIK 
TFX3SSGSSAV D0G3M3VLPD LASE33XKGS EVESYSSMPP LBGEPODPOL 
SDGSWSIVSE EASECWCCS MS^jTIWIGALI TPCAAEESKL PINELSNSLL 
PHHNMVYMT SRSASLiFJQKK VTFTEmSVLD EHYRETiTLKEM KAKASIVKAK 
luLSXEEACKL TPEHSAKSKF GYGAKCVRNL SSRAVNHIRS VWEDTT.KnTE 
TPlUi'i'lMAK SEVPCVQPEK GCSRKPARLIV FPDLGVFVCE KMALYEWST 
LPQAVM3SSY GpiQYSPKQRV EFXA7NIWKSK KCHyGFSYDT RCFIDSIVTES 
DIRVEESIYQ CXXHAPEARQ AIRSLTERLY lOGPLHSISKG Q^IOCJfERCRA 
SGVLTTSQOSI TLTCYUCAIA ACRAAKDQDC IMLVISmXV VICESAGIQE 
DAAALJlAFrE AbOTiYSAPPG DPPQPEYnLE LTTSCSSNVS VAHDASC3<KV 
YYLTERDPITP LARAM^ETAR HTPINSWLOSr IIMYAPIUWA EMTLMIHFFS 
HJLAQEQLEK AmDQIYGAC YSIEPLOLPQ IIEEILHGLSA FTLHSYSPGE 
INRVASCLRK LGVPPLRTWR HRARSVRAKL L9QQGRAATC GRYLITSIWAVR 
TKLKLTPIPA ASQLDL9GWF VAGYSOGDTY HSLSRARPRW FPLCLLLLSV 
GVGIYT iT iFNR 
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#2. Strategy for constructing chimeric clone of HCV (pH77CV-J4) 
which contains the nonstructural region of strain H77 and the 
structural region of strain HC-J4 

^„,__ , delete pCV-H77C 



^ ~ - ►*^*^l^*^**^>**l^■^^****^^■^^^»■A■■.^n^*♦■^»*»^ 
C+E1 +E2+P7 ►iNSa + NS3 ♦ NS4 A* NS4B + NS5A + NS5B ♦♦♦ 



Aaa I f 1 5B> Cla I (7 1 0) 1 47 III (Wunt end; 2851 ) Hind III Afl II 

y-ii/c7 • \ I \ w ▼ /70C0» m^(S.r^\\ iaAni\ 



GCA TAC GCA 

/Vde I • (2763) 



(7862) (9160) (9403) 
gc C ATA T Gt 
gcT ATA TGt 



H7851S ► H9173R(P-M) 

^ T° c 

H9140S(P-M) -)e» ^ H9417R 



PC/7 products ' ► C/a l/yde I Fco 4 7^111 ^t/s/on PCff ° ^ 

H2751S (Clal/Ndel) H2a70R 

xho I (2282) Nde I * (2763) 
PCV-J4L6S D 

J4-2271S J4-2776R(Ndel) 

1. Fragment A, B, C and D ; PGR amplification from pCV-H77C or pCV-J4L6S 

• Fragment A ; additional Cla I site, artificial Nde I site induced by a single mutation 
(C^T at nt 2765 of H77C) and authentic Eco47 III site 

• Fragment B and C ; eliminated Nde I site by a single mutation within the primers 
(C-^T at nt 9158 of H77C) , and fusion PGR with both fragments 

• Fragment D ; artificial Nde I site induced by 2 point mutations within the primer 
(T-^A at nt 2762 and G-^T at nt 2765 of J4L6S) 

2. TA cloning of PGR products 

3. Sequence analysis 

4. Cloning of Fragment A (C/a \-Eco 47III ) and Fragment B/C (H/ndlll-Af/ll ) with correct 
sequence into pGV-H77G 

5. Complete sequence analysis of new cassette vector [pHTTCV] . into which the structural 
regions of different genotypes can be inserted. 

6. Cloning of Fragment-^ifire MXho I (cut out from pGV-J4L6S) and Fragment D (Xho \-Nde I) 
with correct sequence into the new cassette vector ; 3 piece ligation 

7. Complete sequence analysis of 1a+1b chimera [ pH77CV-J41 

8. In vitro transcription (within 24 hours of inoculation) 

9. Percutaneous intrahepatic transfection into chimpanzee 
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pH77CV-J4 Sequence 

QCCAQCXDQCX: TGAIGQaQQC GACACICCAC CATCAATCAC TCCCCIGIGA 50 

QGAACEACTG TCnCAOaCA GAAAGGGICT AGQCATOG03 TTAGfEATCAG 100 

TCriOJroCAG OTIDCS^GGAC 0000^ 150 

a3GAAOCX3Gr GAGTACAOCG GAATIQOC3^G GACGACCaQG TO L ' l ' i ' iUi ' lU 200 

GATCAAOOOG CICAATOOCT GGAGAl'i'iUG GOSIQCXDOOC Q03AC»CIQC 250 

TAOOOGAGIA GiUi'iUQGIC GCJGAAAGaOC ' i ' lUiUCJ-LA CT QGCKSm^ 300 

GiUL'i'iUOGA. GfIGOC3GOa3G AQGTCICXJEA GAOaGTEtSCAC CA3GM3CA03 350 

AALLUL'iAAAC CTCAAAGAAA. AA02AAACGT AACACCAAOC GOOQOOCACA 400 

QGAOSICAAG TIOXGGGOS GIGO'lXliAGAT (JUl'lUJiUA GmftOCIGT 450 

1Q0Cm3CAG GQQOOOCAGG TTGGGflGTOC QOQOGIACrAG GAAGG CriUL' 500 

GAG33Gfia3C AACCTOGTEOG AAGQ03ACAA CXTATOOCAA AQQCTOOOOG 550 

ACaDGAGGGC AQGOQCiaOS CICAG0G03G GTA00Cnt3G CmnCTAIG 600 

QCAAIGAQGG CXTTOOOGTOG GCAGGATOOC TOCIGTCACr 003000010: 650 

OOGOOTAGnT GOOOOOOCAC QGAQ000003 0013^001030 GEAACTIGaG 700 

TAAQGIOATC GATADOOTIA CATOG3QOIT O300GATCIC PaX3333IPCA 750 

TTCOQOrOOr OOOOOOOOOO CTAQOOOaOG OroOCAQQQC OITOOCACAC 800 

QGTCIOOOQG TICB3GAGGA CGaOGTEGAAC TATOCAACAG GGAACTIQOC 850 

OGGITGOICr TIUrOIATCT TDCiL'i'iUGC TCIGOIGIOC TOTTTCACCA 900 

TCOCAGOnC OOOITATGAA GIGCGCAAOG TUTOOGGGAT AIACCATCIC 950 

ADGAAOGACr QCIOCAACIC AAGCATTGIG TATCAGQCAG 03GAD0IGAT 1000 

CATGCATACr OOOOOOIGOG TGOOOIGTGT TCAGGAGQOT AACAGOTOCC 1050 

GTIGOTGOOr AQ03CTCACT OOCADOOIOS OGOOCAGGAA TOOCAQOGIt: 1100 

OCCACIAOGA CAAIADGAOG QCACGTOGAC 'I'iUL'iUGi'iG GGACQQCIGC 1150 

TTICIGOIO: GCrATGTAOG TOGGGGAICT CIQCGGATCT AliTi'iLXTIOS 1200 

TOIDOCAQCr GITCADOnC TOGOOIOQOC GQCAIGAGAC AGIQCSGGAC 1250 

TQCAACIGOr CAATCTATCO OGOOCATCrA. TCAGGIGAOC GCATOQCTIG 1300 

GGAIATCATC AIGAACIQGT CACOTACAAC PmXTAGIG GIGIOOOCT 1350 

TGCroOOGAT OOCACAAGCr GIOGTGGACA TOGIGGOOOG GGOOCACIQG 1400 

GGAGIOOIGG OOQOUL'i'itJC CTACIATIO: A3GGr?03GA ACIGGQOTAA 1450 

GGnCIGATT GIGOOOOTAC TCITIG0O3G CGTTGAOOGG GAGAOOCACA 1500 

OGAOOQGGAG GOTOOOOOQO CACADCACOT OCQQOnCAC GiUUL ' i ' ri ' lU 1550 

ICATCIGGGG OOICICAGAA AATCCAGOTT GIGAA33^CCA AOOOCAGOIG 1600 

GCACATCAAC AGGACIGOOC TAAATIGCAA TGWZTOOOIC CAAACIGOGT 1650 

TCrnOOOOC GCIUITTTAC GCACACAPCT TCAACICGIC OOOGIGOOOG 1700 

GAG0QCATG3 OCAQOIGOOG OOOCATIGAC TGOnOOOOC AGQ30ia03G 1750 

OCCCAICADC TATACTAAQC CTAACAGOIC GGAICAGAQG QCTEATIQCr 1800 
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pH77CV-J4 Sequence 



GQCATTAOQC QGCICGAOQG TGTEQGIOiaG TACXDOQQGTC QCAGGrorGT 1850 

QGTCCAGTEGr Ai'iUi'i'iCJAC CCCP>N3O0Cr Ui'iUiULJiUG GGftOCACOGA. 1900 

TCGirCOGCT GIDQCEAOGT ATAGCTOOOG QGAGAAIGftG ACAGACGTCA. 1950 

TOCTOCTCAA CAACAaaOGT CX:X30CACAAG QCAACiUJiT CQQCIGTACA. 2000 

TX3GA3GAAIA. GrCACTOQGnT CACTAAGA03 TOOOGAGGTCC OOCXJOiUrAA 2050 

CATOOaOGOG GTCOGIZAACC QCALXJi'iGftT CTODCOCACG GACiUJ l ' l UC 2100 

QQAAQCAOaC 03AGQCI3CT TACACAAAAT GIG3CTO3QG QOCXTDSGfTIG 2150 

ACAOCTAQGr QOdWSTAGA. CI3^CaCA33^ AQQL'i'i'iLGC ACTAOOCCIG 2200 

CACICICAAT TTnOCAHCr TTAAGGTETAG GftTCTEATOIG Q3QQGCG1UG 2250 

AGGACftOQCT CAATOOOGCA TQCAAnOGA CTOGftOGAGA GOQCIGEAAC 2300 

TP3GAQGACA QQGAIAGGfIC AGAACICAGC 033CIGCI13C TGICTACAAC 2350 

AGAGIGQCAG ATACIQCXXT UiUL'i'i'iCAC CAOOCTAODS QCnTATOCA. 2400 

CIQGTnGAT CCA.TCIDCAT CAGAACAIOG TOGAOTIQCA AIAOC IU I IA C 2450 

QGiUm33Gr CAQOJiTiUi' CiULTi'iUCA ATCAAATOG3 AGE?«2AroCT 2500 

Ui'iUL'i'i'i'iL' Ci'iUiCCIGG CAGAOSOGOG CGIGrGIQQC T3L'i'iUiUGA 2550 

TGATOCIGCr GATAGOCXHAG GCIGAGGOCX3 CXTEAGAGAA Ci'iUUiUGiL' 2600 

CPCAATOOGG OGIOOGIQGC QGGAGOGCAT GGTATICICr C L ' iTiL ' i ' lUi ' 2650 

Gi'iUi'iCTOC GCOGQCIGGT ACATTAAGOG CAQGCTOGCT OCIQQQQGGG 2700 

OGTAIGCnr TTATOOOGTA TOQQOGCIGC TCCIQCItXT ACIGGGGTIA 2750 

CCADCAOGAG CATATOCACT GGACACGGAG GiaSQOGOGT OGIGTOQOGG 2800 

OGITGriCrr GrOOaGITAA TGGOGCIGAC TCIGTCGOCA. TATTACAAGC 2850 

GCTATAnTGAG CIGGIGCATC TOGIOQCTIC ACTAli'i'i'iUi' GADCAGAGTCA 2900 

GAAQOQCAAC TGCAOGIGiG GGTIOCXDOCX: CTCAAGGICC QQGGGGGGCJG 2950 

OGATGCXDGIC ATCITACICA TGIGIGEAGT ACAOOOGAO: CiUGLftlTIG 3000 

ACATCADCAA ACTACIOCIG GOCALLUi'iUJ G^OiXTTTIG GATIUnCAA 3050 

GQCAGITTGC TTAAAGIDOC CEACTIDGIG OOOGTICAAG GOL'i'iUiUGG 3100 

GATCIGQQOG CTAQQQOGGA AGAIAGOCGG A3GICAITAC GIGCAAA3X3G 3150 

QCAICATCAA GTTAQGGQCG CTEACIQGCA CXTIAliGiGiA TAAOCATCIC 3200 

ACOCCICTTC GAGACIGGGC GCACAAOSQC CTGOGAGATC TGQOOGIGGC 3250 

TGIQGAACCA. GIGGICTICT COOGAATOGA GACCAAGCIC ATCAOGTGGG 3300 

GGGCAGAaS^ CDGCDGOGIGC GGIGACATCA TCAAa3aCIT GOOOGICICT 3350 

QCQOGTAGGG QCCAGGAGAT ACIQCITGGG CXZAGOOGAOG GAA3GGICIC 3400 

CAAGGQGIQG AGGi'iGUiGG CX30GCA3CAC GOOGTADGOC CAGCAGACGA 3450 

GAGGQCrOCr AGOGIGTAIA ATCAJDCAQOC TG?^CTOQQQG GGACAAAAAC 3500 

CAAGnOGAQG GIGAGGIDCA. GATCGIG1CA AL'iGL'mDOC AAADCTIOCT 3550 

GGCAAOGIGC ATCAATGGGG TATGCIGGAC TGICTACCAC GGGOOGGGAA 3600 
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PH77CV-J4 Sequence 



OGAQGAOCAT CGCNK^^CCC AAQ3G?IOCTC TCATOCAGAT GTAIACCAAT 3650 

GTGGACXZAAG AOJi'iUiUQG CIGGQOGQCT CTTCAAaJCT CDOOGCICATT 3700 

GACACQCIUr ACCIG033Cr CCTOaGADCT TTAOCJiaSIC AOGAOaCACG 3750 

ODGAroiCAT TDaDGIQGQC OQaOGAGGTIG AIMCAGQQG TAGQCIQCIT 3800 

TCQGGOQQaC QCALL'i'iUJiA CITGAAAQQC TOCTCQGQQG GTCCX3CIGIT 3850 

GIGOGOOQCG GGACAOOaOS T33Q0CTArr €2^00300003 GIGIQGAOOC 3900 

GIOGAGIGQC TAAAQOOGIG GACTTTAICC CTCIGGAGAA OCTAGGGACA 3950 

ACCAIGAGAT 0CXXX30IGIT CAOGGACAAC TCCICTOCAC CAGCAGIOOC 4000 

OCAGAQCnC CAGGIQQOOC ADCIGCATOC T00CAC0G3C AQGGGTAAGA 4050 

GCACCAAGOr O0000CIQ03 TADQCAGOOO AG300EACAA. GGIGnOGIG 4100 

CICAAOOOCT CiUi'iUL'iGC AAOQOIQQOC TnOGIQCIT ACAIGIOCAA 4150 

QGQOCAIQOG GTIGATCCrA ATATCAQGAC OaOOGIGAGA. ACAATIACCA 4200 

CIQGCAGQOC CATCADGTAC TOCACCEAOG QCAAGTIOCr lOOOGAOSGC 4250 

QGGIQCICAG GAQGIQCITA TGACATAAm ALL'i'iUiUAOG AGIGOCACIC 4300 

CAOSGAIGGC ACATCCATCT TOGGCATOGG CACIGTDCIT GADCAAGCAG 4350 

AGACIQOQQG G303AGACIG Gi'iGiUC'iLG OGACIQCTAC COCTOOGGGC 4400 

TCOGICACIG T3IO0GATCC TAACATOGAG GAGGTIQCIC TGIDOACCAC 4450 

GGGAGAGA.TC OOCTTITADG QCAAGQCTAT QQOOCIOGAG GIGATCAAQG 4500 

QGQGAAGACA TCTCATCTTC TGQCACICAA AiGAAGAACTG OGAOG^^OCIC 4550 

GODGOGAAaC TQGTGGCATr GQGCAICAAT GQOGIGGOCT ACTADOGOGG 4600 

TCITGACGrG TCTGICNICC CGAOCAODGG GGAliUi'iUiC GICGIG1CGA. 4650 

OCGATOCICr CATGACIGGC TTTAOIBGOG ACITOGACIC TGIGAIAGAC 4700 

IGCAACAOOr GrrcrTCACICA GAC?CT03AT TICAGOCTIG ADGCrACXTIT 4750 

TADCATIGAG ACAAOCAOGC TCOOOCAQGA. TOCIUICIOC AGGACICAAC 4800 

GOOGQGGCAG GACTGGCAQG QGGAAGOCAG QCATCIATAG Ali'i'iUiUGCA 4850 

CXDGGGGGAGC QCODCroOGG CATCTIOG?^ TOGIOOGIOC ICIGIGAGIG 4900 

CTATCAOGOG QGtJiUiUL'iT GGTATCAGCT CAaSOQOGQC G?^GACrACAG 4950 

TTAQGCrAaG AGOGTACATC AACAOOOOGG GGLTiODOGr GIGOCAGGAC 5000 

CAICriGAAT TnOGGAQOG GGICnTPOG GGOCICACIC ATAT^^GAIOC 5050 

CCACrmTA TCOCAGACAA AQCAGAGI03 GGAGAAdTT OCrEAOOIQG 5100 

TAGOGTACCA. AQ0CAD03IG TG03Om33G CICAAGGOOG TOOOOCATOG 5150 

TGGGAOCAGA TOTGGAAGIG TTIGAIOOGC CTTAAAOGCA. OOCIDCAnTGG 5200 

QOCAACAOQC CTOCTATACA GACIQGGOGC TCTIC^GAAT GAAGICACQC 5250 

TGAGGCACQC AATCAOCAAA TACATCATCA CATOCATCIC GQOGGAOCIG 5300 

GAGGTOGICA. OGAGGAQCIG GGflGCIOGIT GGOGGOGIOC TGOCIGCICT 5350 

GGOOGGGTAT TGOCIGTCAA CAQQCiGGOr GGICATAGIG GGCAGGAiaS 5400 
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pH77CV-J4 Sequence 



TCTIGIDOOG GAAGCXDQQCA. ATTAIACCIG ACAGQGAGGT TCTCmjCPG 5450 

GAGrnOGAIG AGAIGoAAGA. GTQCICICAG CACTEACOGT iOmDGAGCA 5500 

AQQGA[IGAIG CI03CIGAQC AGTICAAGCA. GAAQQCXCTC QGOCIOCiUC 5550 

AGAOCXaOGTC CXDQCDCAraZA GAGGflTATCA CXDCJCIQCIGT CJCAGACX^AC 5600 

TX3QCAGAAAC TDGAGGICIT TiaOOOGAAG CACATOIOGA AITICftTCAG 5650 

TOOGMSCAA ■mcnaaaQG gojigicaac qciqociqgt AAOcmaocA 5700 

TIQCTICATr GATOGCrnr ACAQCIGOCG TCADCAQOQC ACTAACCACT 5750 

GQGCAAAOaC TXXTCTICAA CATAnGQQG QQCT3QGfIQG CIQOQCAQCT 5800 

GQOcxaaaxx: a^rocxDOcm ciacxrrriGr QOGnacrooc crAociaoos 5850 

CXDQOCATOQG CAGOSITOGA. CIQQQGAAGG TOCTOGflGGA CA LL ' iCl ' lU GA 5900 

QGCTATOQCG OQGQCXSIQQC GGGJOJiUiT GfTAGCATICA AGAICATCAG 5950 

OGGIGAGGIC CXXnOCAOaG AQGACCIQGT CAATCIQCIG CXX:X3C]CATOC 6000 

TCIOQCXITOG AGOCXJi'iUiA GTCQGIGIQG TCIQCQCAGC AATACIGQQC 6050 

OaOCADSriG GC0033Q03A GQGGQCAGIG CAATOGftlGA AD033CmAT 6100 

AGOCITCQOC IDCXDOaaQGA. AOCALLUi'i'iU OCX3CACQCAC TACGIGOCGG 6150 

AGAQOGATOC AGOOGCXX33C GICACIGOCA TACICAQCAG OCTCACiUiA 6200 

ACOCAQCrOC TGAQQQGACr QCATCAGIOG ATAAQCTQQG AGnTCTCACXZAC 6250 

TCCATOCIOC GGTICCIGQC TAAQQGACAT CIGQGACIQG ATAIQQGAGS 6300 

TCCTGAGOGA CnTAAGAOC TOGCTGAA^ CXZAAGCICAT GOCACAACIG 6350 

QCTOQGAITC CCi'i'iGiUiC CTOOCAGOQC GOGriAIAQaG GQCJCCIOQOG 6400 

AGGAGAOSQC ATTAIQCACA CTOaCIQOCA CIGIGGAGCT GAGATCACTG 6450 

GACATCICAA AAAa3QGAOG ATGAGGATCG TOQGICCEAG GADCIQCAQG 6500 

AACATCTOGA GTOOGAQJET COXATIAAC QQCTACACXA. a3QQ0CXDCIG 6550 

TACIOCXDCrr CCIODQQOGA ACTATAAGIT OSOQCTGIGG AGGCJCGICIG 6600 

CAGAOGAATA CXJD3GAGATA AGQOGQGIGG GGGACITOCA CrADGfTATCG 6650 

GGTCATCACrA CIGACAATCT TAAATOCCXDG TOOCAGATCC CATCGCXDOGA 6700 

ALL'i'iTiCACA GAATIQGAOG <3C3GB30CCCT ACAC^^iGGTIT GOQCGCQCTr 6750 

GCAAGOQCrr QCIGCG3GAG GAQCTATCAT TCSCAGIAGG ACIDCAOGftG 6800 

TADQ03GTOG GGrPGQCAATT ACCITQOG?^ OCX3GAAOCGG ADGTAGOOCT 6850 

GnGAOJDGC A3X3CICACIG ATOGCIDOGA TATAAC^OIA G?^OGCQ3CX]G 6900 

GGAGAAGGTT GQQGAGAGQG TCAOQQQCTr CTATOaOCAG CIOCTCX3GCT 6950 

AGOCAQCICT OOSCroCATC TCICAAGGCA ACnOCADCG CXiAACCAIGA 7000 

CTCCCCIGAC GCOGAGCTCA TAGAQGCTAA Ol'iCXlTCIGG AQGCAGGftGA 7050 

TQQGOQQCAA CATCAOCAGG GTIGAGIC?^ AGAACAAAGT GGIGATIUIG 7100 

GACrocriOG ATOOGCi'iUr GGCAGAGG^ GATCAGOGGG AGGICTCCGT 7150 

ACCTOCAGAA AnCTQa3GA AGTECTOGG^ ATIDQOOOGG GOOCTOQOOG 7200 



FIG. I6D 




pH77CV-J4 Sequence 



TCIQQQCX30G QQ03GACTAC AACXXXXXX3C mSEAGAGAC GIGGAAAAAG 7250 

OCTGACTAOG AAOCAOCTCT QGriDCA^^ 7300 

GTCXrCTCCr GlOaCIOCXaC CTOQGAAAAA GOGrCAOQGTC GIOCICAOOS 7350 

AA-TCAACQCT AICEACIQGC TTOGCXDGAQC TIGOCACX^A AA UlTriU QC 7400 

AGCiccTCAA crroaaacAT TAoaaooGftc aatwdgacaa cftrcciciGA. 7450 

QCaDQCXDCrT TCIQQCIQOC OOOOOGACIC OGAOSnGAG lOJJ m 'lUrr 7500 

(XaiQOCm: CniQ G ftQQGG GAG0CTO3QG AiaDOGATCT C3^GaGA03Q3 7550 

TCATOGIOGA OGCSICAGEAG TOOQaaOGAC AOOGAAGATC TOGnX?IQCI G 7600 

CICAATOICr TAnOCTOGA. CAOaoaCACT OGTCAOOaOS TQOQCIGOQG 7650 

AAGAACAAAA. ACIC300CATC AADQCACIGA. GCAAdOGIT QCEACQOCAT 7700 

CACAATCIGG TGI?OTXXAC CACTICAOGC AGIQCTIQOC: AAAQ3CAGAA 7750 

GAAAGflCACA TTIGACAGAC TGCAAGTICr GGACAGOCAT TAC3CAQGACG 7800 

TQCICAAQGA GGICAAAGCA GGGOGGrrCAA AAGTGAAGGC TAAL'i'iUL'iA 7850 

TODCTAGAGG AAQCnOCAG CCIGAOSOOC CCPCP^TTCPG CXIIAAATCCAA. 7900 

Ui'i'iUGCTAT GQQGCAAAAG AGGTCOGTIG (DCATOOCAGA AAOXOGIJiG 7950 

OOCACAIGAA CIODGIGTOG AAAGAOCTIC TOGAAGACAG TUEAACACCA 8000 

ATAGACACTA CXiZAICAIOQC CAAGAACGAG Gi'i'i'iUiUOG TICAGCXriGA 8050 

GAAGGQGQCT OGTAAQQCAG CiOGICICAT OJiUi'iLOOC GAOCIGQQGG 8100 

TGOGQGTGIG QGAGAAGAIG GOCTGrAOG AOGTIGCJITAG CAAQCIDOQC 8150 

CIQQOOGrOGA TGGGAAQCIC CTAOQGATIC CAATACICAC CAGGACAGCG 8200 

GGTIGAATTC CTOGIQCAAG OGnOGAAGTC CAAGAAGACXZ COGATOGQGT 8250 

TCIOGTATCA TACOOQCIUr TTIGACIOGA CAGTCACIGA. GAGOGACATC 8300 

CCTAOGGAGG AGQCAATTTA CCAALLUi'iGi' GACCIQGACX: CQCAAQOOOG 8350 

QGrOGGOCATC AAGTCCXTICA CIGAGAiGacr TEALLUi'iUQG QQCXDCICTEA 8400 

CCAATICAAG GGQQGAAAAC TGOGGCTAOC GCAGGiaOOG OGOGAGOGQC 8450 

GTACIGACAA CTAQCIGTGG TAACAGOnC ACTIGCTACA TCAAGGCXX3G 8500 

GGCAQQCICT OGAGOOGC^G QQCIOZAGGA CIGCADCATC CiCGiGIUIG 8550 

GQGADGACIT AGIOGITATC TGIGAAAGIG GQQ3GGIOGA GGAGGAOaOG 8600 

GOGAGOCriGA. GAQOCTICAC QGAGGCTATC ACXAGGrTACT OOGOCCOOOC 8650 

OGQQGACX3DC OCACAACCAG AATADGACTT OGPGCnATA ACATCAIQCT 8700 

anXDCAACCJr GTICAGTOGOC CACJGADGQCG CTGGAAAGAG QGTCTACTAC 8750 

CTTADCXXJDG AOOCTACAAC C0CXDCIO30G AG^^GQQGCGT GGGAGACAQC 8800 

AAGACACACT OZAGICAArr CCTGQCTPGG CAACATAATC ATCITIODOC 8850 

QCACACiGIG QQOGSGGAIG ATACIGATCA. COCAli'i'iGi'i' TAGOGIGCIC 8900 

ATAQQCAQQG ATCAGCITGA. ACAQGCICIT AACIGIG^O^ TUEACXaGAGC 8950 

CTGCIACIGC ATAGAADCAC TGGATCTACC TCCAATCATT CAAAGACICC 9000 

FIG. I6E 



pH77CV-J4 Sequence 



CQCAl'i'i'iLA CIDCACAGTT ACICTOCAQG IGAAAICAAT 9050 

AQOGroaCQG CAItXJCTCAG AAAACTIQCSG GroOOOOOCT TOOGAQCTIG 9100 

GAGACAO03G GOaDQGAQOG T00QCX3CTAG GJi'iCiUiO: AGAQGAGQCA. 9150 

QGQCIQCIAT ATCfTOQCAAG TACC'iUi'iUA. ACIQQQCftGfr A?^GAACAAAG 9200 

CICAAACICA CIOCAATAQC QQOaQCIQQC CGGCiGGACT TCj l UJULji' m 9250 

GnCAOOQCr QQCTACAQOG GQQGAGACAT T3]ATCACAGC GIGICICATC 9300 

CQQQQOCXXXS CiULJi'lUiUG 'i'i'i'iUQCTAC TOC.'IGCTOQC TQC2^Q0asm 9350 

QGCATCEAO: lOCIQCXDCAA CXXSATCAAQG TIGOaGTEAAA CACIOCXSGOC 9400 

miTAAGOCA Ti'iUL'iUi'iT 'i'i'i'i'i'i'i'i'iT TimTTnT Tl'l ' l ' lumT 9450 

Tirrrricrr locrriocTr crrrrrria: titctittic ccircirrAA 9500 

TOGTEGQCia: ATCTEAGOQC TAGICAaSQC TAQCTGIGAA AQGTCCGI G A 9550 

QCXD3CATCAC TGCAGAGAGT GCIGAIACIG QOCTCTCTOC AGAICATCT 9599 
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H77CV-J4aa Sequence 



10 20 30 40 50 

1234567890 1234567890 1234567890 1234567890 1234Sfi7RQn 

MSINPKPQRK IKFNUSIE^RPQ DVKFKaOQQI VQGVYLLEE^R GEEILGVRAIR 50 

KASEERSQPFG PRQPIEKARR PEGI?M^^AQPG YEWPLYOSEE DGWAQWLLSP 100 

PGSRPSWGPT DPRPRSFNLG KVUJilLTOGF AEOCYIPLV GAPDQGAARA 150 

LAHSVRVIiED GVNYA3X3SEjP QCSFSIFLLA LLSCLTIPAS AYEVRNVSGI 200 

YHVUSDCSNS SIVYEAAEVI MHTPQCVPCV QEr3SISSRCSrt?V ALTPTLAARN 250 

ASVPrnTRR HVDLLVGTCAA FCSAMYVOX 03SIFLVSQL FIFSPRPHET 300 

VQEXNISIYP (SNSSJFMm nylMMNWSPIT ALWSQLLRI PQAWEMWAG 350 

AHWGVLAGLA. YYSMVOSKaZAK VLIVAIM^ VDGEIHITCR VA£3nT9C3T 400 

SLFSSGASQK IQLVNINGSW HINRrALNCN DSDCflGFEAA LF!£AHKF1S1SS 450 

QJPEKMASCR PIDWEAQQWG PITYIKPNSS DQRPYCMI£A PRPOGWPAS 500 

QVCX5PVYCFT PSPWVGTID RSGVPIYSWG ENhlUs/MmSI NIRPPQGJSKAJF 550 

GCIWXINSIGF TE<TCQGPPasr IQGVOSIE^TLI CPiLXJt'KKHP EAIYTKDGSG 600 

FWLTPRCLVD YPYRUWHYPC TDSFSIFKVR Mli^VOGVEHFUli ISIAAOSWIKSE 650 

RCNLEDRDRS ELSPT .T ,T >?IT BWQXLPCAFT TLPALSTCLI HLe3KnVEM2 700 

YLYGVGSAFV SFAIKWEYIL LUT.T.TADAR VCPCLMmJL IAQ?£AALEN 750 

LWLNAASVA GAH3ILSFLV FFCAAWYIKG RLAPGAAYAF YGVWPLLLLL 800 

LALPPRAYM. DrEVAASCX3G WLVGLMALT LSPYYKRYIS WCMWAnjQYFL 850 

TBIVEAQLHVW VPPLjNVRQGR DAVTLIICW HFTLVFOriK LLLAIPGPUW 900 

IljQASm<VP YFVRVQ3LLR ICALARKLAG OiWQyiAIIK LGALTCIYVY 950 

NHLTPLRUm HNGLRDLAVA VEPWFSFME TE<LnV\iGADr AAGGDIIMZ. 1000 

PVSARPGQEI UjGPADGMVS KGWFOILAPIT AYAQQTRGLL GCIITSLTCR 1050 

DKNQVB3EVQ WSTATQIFL ATCINGVCWT VYHGAGTRTI ASPKGPVI(34 1100 

YTNVDQDLVG WPAPQGSRSL TPCTOGSSDL YLVTRHAD^ PVRRRODSRG 1150 

SLLSPRPISY LKGSSQGPLL CPAOiAVGLF RAAVCIRGVA KAVDFIPVEN 1200 

DSrnyiRSPVF TCNSSPPAVP QSPQVAHLm PIGSGKHTKV PAAYAAQ3YK 1250 

VLVmPSVAA TLGPGAYMSK APO/DHMIRT GVRTnTGSP nYSTYGKEL 1300 

ADQQCSQGAY DIIICDEEHS TDATSIDSIG TVLDQAETAG AKLWLATAT 1350 

PPGSVTVSHP NIEEVALSTT GEIPFYGE^AI PLEVIKQGRH LIPCHSKKKC 1400 

DELAAKLVAL GINAVAYYPG LiD^/SVIPTSG DWWSTDAL MiUb'iGUFDS 1450 

VIDCNrCVTQ TVDFSLDPIF TIETTTLPQD AVSRTQRRGR 'CT!£2<PGIYR 1500 

R/APGERPSG MFDSSVDCEr: YDAGCAWYEL TPAEJITVRLR AYMNTPGLJEV 1550 

CQEHLEFV\JB3 VFTGLTHEDA HFLSQTKQSG ENFPYLVAYQ A1VCARAQAP 1600 

PPSWD^^C LIKLKPTLHG PTPLLYRDGA VQSIEVTLTHP ITE<YIMrCMS 1650 

ADLEWTSIW VLVQGVLAAL AAYCLSTQCV VIVGRIVLSG KPAIIPE3EREV 1700 

LYQETOEMEE CSQHLPYIBQ QysyiLAEQFKQ KALGLiQTAS RHAEVITPAV 1750 

CrmWQKLEVF WAKHKE^yJNFIS GIQYLAGLST LPC3SfPAIASL MAFEAAVTSP 1800 

LTIGCfTLLFN HjQGWVAAQL AAPGAATAFV GAGLAGAAIG SVGD3K\7LVD 1850 

ilagygagv;a GALVAFEOMS GEVPSTEDLV NLLPAILSPG ALWGWCAA 1900 
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H77CV-J4aa Sequence 



10 



20 



30 



40 



50 



1234567890 1234567890 1234567890 1234567890 1234567890 
ILRE^HVGPGE GAVQCftMSIRLa: AEASR3SKVS PIHYVPESEA AARVIAILSS 
LTVIQLLFRL H^^SSEnTT PCSGSWLE^I IMDWICEVLSD F1<IWLKAKLM 
PQLPGIPFVS OQPGYHGVWR GDGIMnTOi OGAKTIGHVK ISIGIMRIVGPR 
TGRNMA^SGTF P33SlAYnGPC TPLPAFNYKF ALMRVSAEEY" VEIEHVGDFH 
WSOCTUCi KCPQQIPSFE FFTEUDGVRL HRFAPPCKPL LREhVSt'RVG 
LHEYPVGSQL PCEPEEO/AV LTSMLTDPSH ITAEAAiaRRL ARGSPPSMAS 
SSASQLSAPS LKATCEANHD SECAELIEMSr UJWRQEM3GN riFVESENKV 
VILDSFDFLV AEEDEE^EVSV PAETLE^KSPR FARALPVWMl PD^flSIPPLVET 
V^ZKKPDYEPPV VHXI^LPPPR SPPVPPPE^ RIWLTESTL SEfiLAELAIK 
SPGSSSrSGI TtXNrnSSE PAPSQCPPDS E^/ESYSSMPP LEGEPGDPDL 
SDGSWSTVSS GAEfTECWDC SMSYSWIGAL VTPCAAEBQK LPIISI?ULSNSL 
liy^HNLVYST TSRSAOQFQK KVTFTKLQVL DSH!£QEVLKE VKAAASKVKA 
NLLSVEEACS LTPEHSAKSK PGYGAKEVRC HARKAVMHN SVWE<I3LLEDS 
Vi'PlUi'i'IMA. KNEVPCVQPE KG3?KPARLI VFPDLGVRVC EKMAL"YEWS 
KLPLAVM3SS YGFWSPGQR VEFTLVQAIAZE^ KKTHyiSFSYD TRCFDSTIVTE 
SDIRTEEAIY QGGDIJDPQAR VAIKSLTEFtL YVQGPLINSR GHSDGYPECR 
ASGVLTTSQG NTLTCYZKAR AACERAAG3JQD CIMLVO^CDL WTCESAGVQ 
EDAASLRAFT EAMTRYSAPP GDPPQPEYDL ELITSCSSNV SV2^HDGAGE<R 
VYYUIRDPIT PLARAAWETCA RHTPVNSWDG NUMFAFTLM ARMELMIHFF 
SVLIARDQLiE QALNZEIYGA CYSIEPLDLP PIIQRLmLS AFSIliSYSPG 
EINERVAACLR KDGVPPU^AW RHRARSVRAR LLSRQGRAAI CGKYLFI^V 
RIKLKLTPIA AAGRLJDL9GW FrAGYSQC3DI YHSVSHARPR WFV\JPCLLLLA. 
AGVGIYIiLFN R 
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#1a. 3' Deletion mutants of pCV-H77C 

Sequence of 3' untranslated region of pCV-H77C 
5'UTR I 



i''^3ntsi 81 nts 



( 3' variable region; 43 nts ) 3' variable poly U-UC 3' conserved 

•TGK (Stop codon for polyprotein) region region 

AGGTTGGGGT AAACACTCCG GCCTCTTAAG CCATTTCCTG 

(poly U-UC region; 81 nts) 

TTTTTTTTTT ttTTTTTTTT xTTTTTTTCT xTTTTTTTTT CTTTCCTTTC 

CTTCTTTTTT TCCTTTCTTT TTCCCTTCTT T 

(3' conserved region; 101 nts) 

AATGGTGGCT CCATCTTAGC CCTAGTCACG GCTAGCTGTG AAAGGTCCGT 

GAGCCGCATG ACTGCAGAGA GTGCTGATAC TGGCCTCTCT GCAGATCATG 



#1a -1. pCV-H77C(-98X) ; 3' 98 nucleotides removed from pCV-H77C 

TGAAGGTTGG GGTAAACACT CCGGCCT CTT AAG CCATTTC CTGTTTTTTT 
TTTTTTTTTT xTTTTTTTTT TCTTTTTTTT TTTCTTTCCT TTCCTTCTTT 
TTTTCCTTTC TTTTTCCCTT CTTTAAT 

#1a -2. pCV-H77C(-42X) ; 3' 42 nucleotides removed from pCV-H77C 

TGAAGGTTGG GGTAAACACT CCGGCCT CTT AAG CCATTTC CTGTTTTTTT 

TTTTTTTTTT TTTTTTTTTT TCTTTTTTTT TTTCTTTCCT TTCCTTCTTT 

TTTTCCTTTC TTTTTCCCTT CTTTAATGGT GGCTCCATCT TAGCCCTACT 

CACGGCTAGC TGTGAAAGGT CCGTGAGCCG CAT 

#1a -3. pCV-H77C(X-52) ; All of the 3' UTR sequence, except 3' 49 nucleotides, 
removed from pCV-H77C 

TGAGCCGCAT GACTGCAGAG AGTGCTGATA CTGGCCTCTC TGCAGATCAT 



FIG. I7A 




#1a -4. pCV-H77C(X) ; All of the 3' UTR sequence, except 3' 101 nucleotides, 
removed from pCV-H77C 

TGAA ATGGTG GCTCCATCTT AGCCCTAGTC ACGGCTAGCT GTGAAAGGTC 
CGTGAGCCGC ATGACTGCAG AGAGTGCT6A TACTGGCCTC TCTGCAGATC 
ATGT 

#1a -5. pCV-H77C(+49X) ; The proximal 49 nucleotides of the 3' conserved 
region ( 98 nucleotides ; AAT not included) removed from pCV-H77C 

TGAA GGTTGG GGTAAACACT CCGGCCT CTT AAG CCATTTC CTGTTTTTTT 

TTTTCCTTTC TTTTTCCCTT CTTTAATGCC GCATGACTGC AGAGAGTGCT 
GATACTGGCC TCTCTGCAGA TCATGT 

#1a -6. pCV-H77C(VR-24) ; First 24 nucleotides of the 3' variable region 
removed from pCV-H77C 

TGACTTAAGC CATTTCCTGT tTTTTTTTTT tTTTTTTTTT TTTTTTTCTT 

AATGGTGGCT CCATCTTAGC CCTAGTCACG GCTAGCTGTG AAAGGTCCGT 
GAGCCGCATG ACTGCAGAGA GTGCTGATAC TGGCCTCTCT GCAGATCATG 
T 

#1a -7. pCV-H77C(-U/UC) ; Poly U-UC region removed from pCV-H77C 

TGAAGGTTGG GGTAAACACT CCGGCCT CTT AAG CCATTTC CTGAATGGTG 
GCTCCATCTT AGCCCTAGTC ACGGCTAGCT GTGAAAGGTC CGTGAGCCGC 
ATGACTGCAG AGAGTGCTGA TACTGGCCTC TCTGCAGATC ATGT 
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#1b. Strategy of 3' Deletion mutants 

#1 b -1 . pCV-H77C(-98X) 



3' variable 
region 

I L 



poly U-UC 
region 



Afl II (9403) 



J6SL 



3' conserved 
region 
(98 nts) 



PCR 



Xbal 



1. PCR Amplification 

2. Purification of PCR products 

3. Digestion with -4f/II and Xba I 

4. Cloning of Aflll /Xba I fragment into pCV-H77C 

5. Complete sequence analysis 

6. in vitro transcription (within 24 hours of inoculation) 

7. Percutaneous intrahepatic transfection into chimpanzee ; 11/25/97 and 12/17/97 

8. Result : Negative ( No replication) 



#1b-2. pCV-H77C(-42X) 

3' variable poly U-UC 
region region 



I 



3' conserved 
region 
1 (42 nts) 



Afl II (9403) Nhe I (9530) 

Synthesized Oligonucleotides ^^hh 

A 

Xba I* 

1. Synthesis of oligonucleotides ( sense and anti-sense ) 

2. Hybridization of oligonucleotides 

3. Digestion with Nhe I and Xba I 

4. Cloning of Nhe I /Xba I fragment into pG9-KL26 (3' UTR of H77C) 

5. Sequence analysis 

6. Cloning of 3' UTR ( -42X ) [Afl II /Xba I fragment] into pCV-H77C 

7. Complete sequence analysis 

8. in vitro transcription (within 24 hours of inoculation) 

9. Percutaneous intrahepatic transfection Into chimpanzee (Schedule; 1/22/98, 2/5/98 ) 



FIG. 170 



#1b-3. pCV-H77C(X-52) 



2 



NS5B 



Njde\ (9160) 



region 



poly U-UC 
region 



3' conserved 
region 



Pfu PCR 



Svntheslz ed'onaonuclf>nf !H^<i 
i b 
. Xba V 



Fusion and Extension 



1. Fragment a ; Pfu PCR amplification and purification 

2. Fragment b ; Synthesized oligonucleotides (anti-sense) 

3. Fusion and extension 

4. TA cloning 

5. Sequence analysis 

6. Cloning Nde l-Xba I fragment with correct sequence into pCV-H77C 

7. Complete sequence analysis 

8. fn wfro transcription (within 24 hours of inoculation) 

9. Percutaneous intrahepatic transfection into chimpanzee 



FIG. I7D 



#1b-4. pCV-H77C(X) 



NS5B 



AWel (9160) 



3' variable 
region 



poly U-UC 
region 



3' conserved 
region 

(101 nts) 



Synthesized Oliaon ucfeotides 



Xba V 



^ ^ Fusion and Extension 



1. Fragment a ; Pfu PCR amplification and purification 

2. Fragment c ; Synthesized oligonucleotides (anti-sense) 

3. Fusion and extension 

4. TA cloning 

5. Sequence analysis 

6. Cloning Nde l-Xba t fragment with correct sequence into pCV-H77C 

7. Complete sequence analysis 

8. In wfro transcription (within 24 hours of inoculation) 

9. Percutaneous intrahepatic transfection into chimpanzee 



FIG. I7E 



#1b -5. pCV-H77C(+49X) 

3' variable po'V U-UC 3' conserved 

region region region 

d i Afn\ (9403) Synthesized Olfaonuclentldi^^ 

Pfu PCR \ ' ^ 

► < ^ ® 

I I Xba I* 



Fusion and Extension 



1. Fragment d ; Pfu PCR amplification and purification 

2. Fragment e ; Synthesized oligonucleotides (anti-sense) 

3. Fusion and extension 

4. TA cloning 

5. Sequence analysts 

6. Cloning AflW-Xba I fragment with correct sequence into pCV-H77C 

7. Complete sequence analysis 

8. In vitro transcription (within 24 hours of inoculation) 

9. Percutaneous intrahepatic transfection into chimpanzee 

FIG. I7F 



#1b -6. pCV-H77C(VR-24) 

3' variable poly U-UC 
region region 

TGA|(24nts)| | 



NS5B 



' conserved 
region 



PCR 



Afl II (9403) 



/N<c/el (9160) 



Afl II * 



1. PCR Amplification 

2- Purification of PCR products 

3. Digestion with Nde I and Afl I 

4. Cloning of Nde I /Aflll fragment into pCV-H77C 

5. Complete sequence analysis 

6. in vitro transcription (withiin 24 hours of inoculation) 

7. Percutaneous intrahepatic transfection into chimpanzee 



#1b -7. pCV-H77C(-U/UC) 

3' variable 
region 

I L 



poly U-UC 
region 



3' conserved 
region 

(31nts) I 



>^f/ II (9403) 
Synthesized Oligonucleotides !!! 



Nhe I (9530) 



Aflll 

1. Synthesis of oligonucleotides ( sense and anti-sense ) 

2. Hybridization of oligonucleotides 

3. Digestion with Afl II and Nhe I 

4. Cloning of Afl 11 and Nhe I fragment into pG9-KL26 

5. Sequence analysis 

6. Cloning of 3' UTR ( -poly U-UC ) [Afl II /Xba I fragment] into pCV-H77C 

7. Complete sequence analysis 

8. in vitro transcription (within 24 hours of inoculation) 

9. Percutaneous intrahepatic transfection into chimpanzee 
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